perl - Extract sequence by ID -
i want search multi-fasta file
>ncliv_004380 | neospora caninum | cathepsin l, related | genomic | ncliv_chrib reverse | (genestart+0 geneend+0) | length=2793 atggacaacagtgagacgcactacgtctccttcctcaacggcgagggcgacgacggattg gagaacggcgagctccaccagcgacgaggcgtccgagccggcggcgtggctgcaactccc tacgtagtaacgactcggacgtacttttggaagaaattcctgcgtcagcgcaactttaaa actcgggcctggatcgcactcgtagcagcggctgtgtctctccttgtctttgcctccttc ctcattcagtggcagggagatgacgatcggggtgttttcccgccgtcaccagtcgaggac cacaaaaccccggtgaacatctgggagtggaaagaagaacacttccagaacgccttcggc >ncliv_004381 | neospora caninum | cathepsin l, related | genomic | ncliv_chrib reverse | (genestart+0 geneend+0) | length=2793 atggacaacagtgagacgcactacgtctccttcctcaacggcgagggcgacgacggattg gagaacggcgagctccaccagcgacgaggcgtccgagccggcggcgtggctgcaactccc tacgtagtaacgactcggacgtacttttggaagaaattcctgcgtcagcgcaactttaaa actcgggcctggatcgcactcgtagcagcggctgtgtctctccttgtctttgcctccttc ctcattcagtggcagggagatgacgatcggggtgttttcccgccgtcaccagtcgaggac cacaaaaccccggtgaacatctgggagtggaaagaagaacacttccagaacgccttcggc >ncliv_004382 | neospora caninum | cathepsin l, related | genomic | ncliv_chrib reverse | (genestart+0 geneend+0) | length=2793 atggacaacagtgagacgcactacgtctccttcctcaacggcgagggcgacgacggattg gagaacggcgagctccaccagcgacgaggcgtccgagccggcggcgtggctgcaactccc tacgtagtaacgactcggacgtacttttggaagaaattcctgcgtcagcgcaactttaaa actcgggcctggatcgcactcgtagcagcggctgtgtctctccttgtctttgcctccttc ctcattcagtggcagggagatgacgatcggggtgttttcccgccgtcaccagtcgaggac cacaaaaccccggtgaacatctgggagtggaaagaagaacacttccagaacgccttcggc
and ids in file this
ncliv_004381 ncliv_004382
i want cut sequences multi-fasta per ids , save them file. there 2 files : 1 contains sequence id
>ncliv_004381 | neospora caninum | cathepsin l, related | genomic | ncliv_chrib reverse | (genestart+0 geneend+0) | length=2793 atggacaacagtgagacgcactacgtctccttcctcaacggcgagggcgacgacggattg gagaacggcgagctccaccagcgacgaggcgtccgagccggcggcgtggctgcaactccc tacgtagtaacgactcggacgtacttttggaagaaattcctgcgtcagcgcaactttaaa actcgggcctggatcgcactcgtagcagcggctgtgtctctccttgtctttgcctccttc ctcattcagtggcagggagatgacgatcggggtgttttcccgccgtcaccagtcgaggac cacaaaaccccggtgaacatctgggagtggaaagaagaacacttccagaacgccttcggc >ncliv_004382 | neospora caninum | cathepsin l, related | genomic | ncliv_chrib reverse | (genestart+0 geneend+0) | length=2793 atggacaacagtgagacgcactacgtctccttcctcaacggcgagggcgacgacggattg gagaacggcgagctccaccagcgacgaggcgtccgagccggcggcgtggctgcaactccc tacgtagtaacgactcggacgtacttttggaagaaattcctgcgtcagcgcaactttaaa actcgggcctggatcgcactcgtagcagcggctgtgtctctccttgtctttgcctccttc ctcattcagtggcagggagatgacgatcggggtgttttcccgccgtcaccagtcgaggac cacaaaaccccggtgaacatctgggagtggaaagaagaacacttccagaacgccttcggc
and sequences without id
>ncliv_004380 | neospora caninum | cathepsin l, related | genomic | ncliv_chrib reverse | (genestart+0 geneend+0) | length=2793 atggacaacagtgagacgcactacgtctccttcctcaacggcgagggcgacgacggattg gagaacggcgagctccaccagcgacgaggcgtccgagccggcggcgtggctgcaactccc tacgtagtaacgactcggacgtacttttggaagaaattcctgcgtcagcgcaactttaaa actcgggcctggatcgcactcgtagcagcggctgtgtctctccttgtctttgcctccttc ctcattcagtggcagggagatgacgatcggggtgttttcccgccgtcaccagtcgaggac cacaaaaccccggtgaacatctgggagtggaaagaagaacacttccagaacgccttcggc
any appreciated.
Comments
Post a Comment