An update of PANM database- version 5.1 for filtering the contaminating fungal gene sequences from molluscan transcriptome data
Dae Kwon Song, Min Kyu Sang, Jie Eun Park, Jun Yang Jeong, Chan-Eui Hong, Yong Tae Kim, Hyeon Jun Shin, Ziwei Liu, Hyeok Lee, Hongray Howrelia Patnaik, Bharat Bhusan Patnaik, Yong Hun Jo, So Young Park, Se Won Kang and Yong Seok Lee
Korea Native Animal Resources Utilization Convergence Research Institute (KNAR), Soonchunhyang University, Asan, Chungnam, South Korea Research Support Center for Bio-Bigdata Analysis and Utilization of Biological Resources, Soonchunhyang University, Asa
The PANM database (Protostome DB) is a public repository of protein sequences available from the Protostomia group (includes Arthropoda, Mollusca, and Nematoda). The latest version of PANM DB v5.0 (released in 2022) contained 21,276,123 protein sequences that comprised 4% of the total NCBI nr protein data. In this study, an update of PANM DB, i.e., version 5.1, is presented that could accurately analyze the large-scale transcript-xome data of molluscs for the contaminating fungal gene sequences. This version can filter out the fungal genes, thereby restricting the annotation to molluscs-only sequences. Using the database, we confirmed 1,589,546 amino acid sequences from 32 fungal species, which can be essentially filtered from the unigenes of 20 species of endangered molluscs. In general, the updated version of PANM DB is expected to enhance the accuracy of bioinformatics analyses of invertebrate NGS data, providing a valuable resource for researchers. PANM version 5.1 can be downloaded for free at https://panm.sch.ac.kr/ for local BLAST analysis.
  
39-1-1-1-4.pdf (444.1K), Down : 40, 2023-05-03 16:56:47

   

사무국 & 편집국 : 충남 아산시 신창면 순천향로 22 자연과학대학 3317호 / Tel: 041-530-3040 / E-mail : malacol@naver.com