001、
(base) [email protected]:/home/test2# ls a.fasta test.py (base) [email protected]:/home/test2# cat a.fasta ## 测试fasta文件 >gene2 myc AGCTGCCTAAGC GGCATAGCTAATCG >gene1 jun ACCGAATCGGAGCGATG GGCATTAAAGATCTAGCT >gene4 malat1 AGGCTAGCGAG GCGCGAG GATTAGGCG >gene3 jun ACCGAATCGG GGCATTAAAGATCTAGCT (base) [email protected]:/home/test2# cat test.py ## 测试程序 #!/usr/bin/python in_file = open("a.fasta", "r") dict1 = dict() for i in in_file: i = i.strip() if i.startswith(">"): key = i dict1[key] = [] else: dict1[key].append(i) for i in sorted(dict1.keys()): print(i) for j in dict1[i]: print(j) in_file.close() (base) [email protected]:/home/test2# python test.py ## 运行程序 >gene1 jun ACCGAATCGGAGCGATG GGCATTAAAGATCTAGCT >gene2 myc AGCTGCCTAAGC GGCATAGCTAATCG >gene3 jun ACCGAATCGG GGCATTAAAGATCTAGCT >gene4 malat1 AGGCTAGCGAG GCGCGAG GATTAGGCG
参考:https://www.jianshu.com/p/403a23fdd7bb