forked from BDI-pathogens/phyloscanner
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathsingle_ref.fasta
178 lines (178 loc) · 8.85 KB
/
single_ref.fasta
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
>B.DE.04.HIV_DE_BID_V4131_2004.JQ403037
ATGGGTGCGAGAGCGTCAGTAATAAGCGGGGGAGAATTGGATAGATGGGA
AAAAATTCGGTTAAGGCCAGGGGGAAGCAAAAAATATAGACTAAAACATA
TAGTATGGGCAAGCAGGGAGCTAGAACGATTCGCAGTTAACCCTGGCCTG
TTAGAAACAGCGGAAGGCTGTAGACAAATATTGGGGCAGTTACAACCAGC
CCTTCAGACAGGATCAGAAGAACTTAAATCATTATTTAACACAGTAGCAA
CCCTCTATTGTGTGCATCAAAGGATAGAGGTAAAAGACACCAAGGAAGCT
TTAGATAAGATAGAGGAAGAGCAAAACAAAAGCAAGAAAAGGGCACAGCC
AGCAGCAGCTGGTGCAGGAAACACAGCAGCAGCTGACGCAGGAAACAACA
GCCAGGTCAGCCAAAATTACCCTATAGTGCAGAACCTACAGGGGCAAATG
GTACATCAGGCCCTATCACCTAGAACTTTAAATGCATGGGTAAAAGTAAT
AGAAGAGAAGAATTTCAGCCCAGAAGTGATACCCATGTTTTCAGCATTAG
CAGAAGGAGCCACCCCACAAGATTTAAACACCATGCTAAATACAGTGGGG
GGACATCAAGCAGCTATGCAAATGTTAAAAGAGACCATCAATGAGGAAGC
TGCAGAATGGGATAGATTGCATCCAGTGCATGCAGGGCCTGTTGCACCAG
GCCAGATGAGGGAACCAAGGGGAAGTGACATAGCAGGAACTACTAGTACC
CTTCAGGAACAAATAGCATGGATGACAAATAATCCACCTATCCCAGTAGG
AGAAATATATAAAAGATGGATAATCCTGGGGCTAAATAAAATAGTAAGAA
TGTATAGCCCTGTCAGCATCCTGGACATAAGACAAGGACCAAAGGAGCCC
TTTAGAGACTATGTAGATCGGTTCTATAAAACTCTAAGGGCCGAGCAAGC
TTCACAGGATGTAAAAAATTGGATGACAGAGACCTTGCTGGTCCAAAATG
CGAACCCAGATTGTAAGACTATTTTGAAGGCATTGGGACCAGCAGCTACA
CTAGAAGAAATGATGACAGCATGTCAGGGAGTGGGGGGACCCGGCCATAA
AGCAAGAGTTTTGGCCGAAGCAATGAGCCAAGTAACAAATTCAACTGCTG
TAATGATGCAGAGAGGTAATTTTAGGAACCAAAGAAAGCCTGTTAAGTGT
TTCAATTGTGGCAAAGAGGGGCACATAGCTAGAAATTGCAGGGCCCCTAG
GAAGAAGGGCTGTTGGAAATGTGGAAAGGACGGACATCAAATGAAGGATT
GTACCACAGAGAGACAGGCTAATTTTTTAGGGAAGATCTGGCCTTCCTAC
AAGGGAAGGCCAGGGAATTTTCTCCAGAGCAGACCAGAGCCAACAGCCCC
ACCAGAGGAGAGCTTCAGGTTTGGGGAAGAGACAGCAACGCCTCCTCAGA
AGCAGGAGCCAATAGACAAGGAACTGTATCCTTTAGCTTCCCTCAAATCA
CTCTTTGGCAACGACCCCTCGTCACAATAAAGGTAGGGGGGCAACTAAAG
GAAGCCCTATTAGATACAGGGGCAGATGATACAGTATTAGAAGAAATAAA
TTTGCCAGGAAGATGGAAACCAAAAATGATAGGGGGAATTGGAGGTTTTA
TCAAAGTAAGACAGTATGATCAGATACTCATAGAAATCTGTGGACATAAA
GCTATCGGTACAGTATTAGTAGGACCTACACCTGTCAACATAATTGGAAG
AAATTTGTTGACTCAGATTGGATGCACTTTAAATTTTCCCATTAGTCCTA
TTGAAACTGTACCAGTAAAATTAAAGCCAGGAATGGATGGCCCAAAAGTT
AAACAGTGGCCATTGACAGAAGAAAAAATAAAAGCATTAGTAGAAATTTG
TACAGAAATGGAAAAGGAAGGGAAAATTTCAAAAATTGGGCCTGAAAACC
CATACAATACTCCAATATTTGCTATAAAGAAAAAAGACAGTACTAAATGG
AGAAAATTAGTAGATTTCAGAGAACTTAATAAGAAAACTCAAGACTTCTG
GGAAATTCAATTAGGAATACCACACCCGGCAGGGTTAAAAAAGAAAAAAT
CAGTAACAGTACTAGATGTAGGTGATGCATATTTTTCAGTTCCCTTAGAT
GAAGATTTCAGGAAGTATACTGCATTCACCATACCTAGTATAAACAATGA
GACACCAGGGGCTAGATATCAGTACAATGTGCTTCCACAGGGATGGAAAG
GATCACCAGCAATATTTCAATATAGCATGACAAAAATCTTAGAGCCCTTT
AGAAAACAAAATCCAGACATAGTTATCTATCAATACATGGATGATTTATA
TGTAGGATCTGACTTAGAAATAGGGCAGCATAGAACAAAAATAGAAGAAC
TGAGACAACATCTGTTGAGGTGGGGATTCACCACACCAGACAAAAAACAT
CAGAAAGAACCTCCTTTCCTTTGGATGGGTTATGAACTCCATCCTGATAA
ATGGACAGTACAGCCTATAGAGCTGCCAGAAAAGGACAGCTGGACTGTCA
ATGACATACAGAAGTTAGTGGGAAAATTGAATTGGGCAAGTCAGATTTAT
CCAGGGATTAAAGTAAGGCAATTATGTAAACTCCTTAGGGGAACCAAAGC
ACTGACAGAAGTAGTACCACTAACAGAAGAAGCAGAGCTAGAACTGGCAG
AAAACAGGGAAATTCTAAAAGAACCAGTACATGGAGTGTATTATGACCCA
TCAAAAGACTTAGTAGCAGAAATACAGAAGCAGGGGCAAGGTCAATGGAC
ATATCAAATTTATCAAGAACCATTTAAAAATCTGAAAACAGGAAAATATG
CAAGAATGAGGGGTGCCCACACTAATGATGTAAAACAGTTAACAGAGGCA
GTGCAAAAAATAGCTACAGAAAGCATAGTAATATGGGGAAAGACTCCTAA
ATTTAAACTACCCATACAGAAAGAGACATGGGAAGCATGGTGGATGGAGT
ATTGGCAAGCCACCTGGATTCCTGAGTGGGAGTTTGTCAATACCCCTCCA
TTAGTGAAATTATGGTACCAGTTAGAGAAAGAACCCATAGTAGGAGCAGA
AACTTTCTATGTAGATGGAGCAGCTAATAGAGAGACTAAATTAGGAAAAG
CAGGATATGTTACTGACAGAGGAAGACAAAAGGTTGTCTCCCTAGCTGAC
ACAACAAATCAGAAGACTGAGTTACAAGCAATTCATCTAGCCTTGCAAGA
TTCGGGATTAGAAGTAAACATAGTAACAGACTCACAATATGCATTAGGAA
TCATTCAAGCACAACCAGATAAAAGTGAATCAGAGTTAGTCAATCAAATA
ATAGAGCAGTTAATAAAAAAGGAAAAAATCTACCTGGCATGGGTACCAGC
ACACAAAGGAATTGGAGGAAATGAACAAGTAGATAAATTAGTCAGTTCTG
GAATCAGGAAAGTACTATTTTTGGATGGAATAGATAAGGCCCAAGAAGAA
CATGAGAAATATCACAGTAATTGGAGAGCAATGGCTAGTGATTTTAATCT
ACCACCTGTAGTAGCAAAAGAAATAGTAGCCAGCTGTGATAAATGTCAGC
TAAAAGGAGAAGCCATGCATGGACAAGTAGACTGTAGCCCAGGAATATGG
CAATTAGATTGTACACATCTAGAAGGAAAAATTATCCTGGTAGCAGTTCA
TGTAGCCAGTGGATATATAGAAGCAGAAGTTATTCCAGCAGAAACAGGGC
AAGAAACAGCATACTTTCTCTTAAAGTTAGCAGGAAGATGGCCAGTAAGA
ACAGTACATACAGATAATGGCAGCAATTTCACCAGCAATGCGGTTAAGGC
CGCCTGTTGGTGGGCAGGGATTAAGCAGGAATTTGGCATTCCCTACAATC
CCCAAAGTCAAGGAGTAGTAGAATCCATGAATAATGAATTAAAGAAAATT
ATAGGACAGGTAAGAGATCAGGCTGAACATCTTAAGACAGCAGTACAAAT
GGCAGTATTCATCCACAATTTTAAAAAGAAAGGGGGGATTGGGGGATACA
GTGCAGGGGAAAGAATAATAGACATAATAGCAACAGACATACAAACTAGA
GAATTACAAAAACAAATTACAAAAATTCAAAATTTTCGGGTTTATTACAG
GGACAGCAGAGATCCACTTTGGAAAGGACCAGCAAAGCTTCTCTGGAAAG
GTGAAGGGGCAGTAGTAATACAAGATAATAGTGACATAAAAGTAGTGCCA
AGAAGAAAAGTAAAGATCATTAGAGATTATGGAAAACAGATGGCAGGTGA
AGATTGTATGGCAAGTAGACAGGATGAGGATTAGCACATGGAAAAGTTTA
GTAAAACACCATATGCATGTTTCAAAAAGAGCTCAGGGATGGTTTTATAG
ACATCACTATGAAAGCAATCACCCAAGAATAAGTTCAGAAGTACACATCC
CACTAGGGGATGCTAAATTGGTAGTAACAACATATTGGGGTCTGCATACA
GGAGAAAGAGATTGGCATTTGGGCCAGGGAGTCTCCATAGAATGGAGGAA
AAGGAGATATAGCACACAAGTAGACCCTGGCCTAGCAGACCAACTAATTC
ATCTGTATTATTTTGATTGTTTTTCAGAATCTGCTATAAGACATGCCATA
TTAGGACGTATAGTTAGCCCTAGTTGTGAATATCCAGCAGGACATAACAA
GGTAGGAACTTTACAATACTTGGCACTAACAGCATTAGTAACACCAAAGA
AGATAAAGCCACCTTTGCCTAGTGTTAGGAAACTGACAGAGGACAGATGG
AACAAGCCCCGGAAGACCAAGGGCCACAGAGGGAGCCATACAATGAATGG
ACACTAGAGCTTTTAGAGGAGCTTAAGAGTGAAGCTGTCAGACATTTCCC
TAGGATATGGCTTCATAGCTTAGGACAACATATCTATGAGACTTATGGGG
ATACTTGGACAGGAGTGGAAGCCATAATAAGAATTTTGCAACAACTGCTG
TTTATTCATTTCAGAATTGGGTGCCAACATAGCAGAATAGGCATCACTCG
ACAGAGGAGAACAAGAAATGGAGCCAGTAGATCCTAGACTAGAGCCCTGG
AAGCATCCAGGGAGTCAGCCTAGGACTGCTTGTACCAATTGCTATTGTAA
AAAGTGCTGCTTTCATTGCCAAATGTGTTTCATGAAAAAAGGCTTAGGCA
TCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAGGATCTCCTCAAGAC
AGTCAGACTCATCAAATCTCTCTACCAAAGCAGTAAGTATATGTAATGCA
ACCTTTAGAAATAACAGCAATAGTAGCTTTAGTAGTAGCAATAATAATAG
CAATAGTTATATGGACCATAGTACTTATAGAATATAGAAGAATATTAAGA
CAAAGAAAAATAGACAGGTTAATTGAGAGGATAAGTGAAAGAGCAGAAGA
CAGTGGCAATGAAAGTGAAGGAGACCAAGAAGAGTTATCAGCACTTGTGG
TGGACATGGGGCATCATGCTCCTTGGGATGTTAATGATCTGTAGTGCAGC
AGGAAATCTGTGGGTCACAGTCTATTATGGGGTGCCTGTGTGGAAAGAAG
CAACCACCACTCTATTTTGTGCATCAGATGCTAAAGCATATAAGACGGAG
GTACATAATGTTTGGGCCACACATGCCTGTGTACCCACAGACCCCAACCC
ACAAGAAATAGCATTGGAAAATGTGACAGAAAGTTTTAATGTATGGAAAA
ATGACATGGTAGAACAGATGCAGGAGGATATAATTAGTTTATGGGATCAA
AGCCTAAAGCCATGTGTAAAATTGACCCCACTCTGTGTTACTTTAAATTG
CACTGACTGGGGGAACGATACTAGTACCTCTGGGAATGCTACTACTACCA
CTGCTGCTACTACCACTAGGAGTTGGGATATGATGGATAGAGGAGAAATA
AAAAATTGCACTTTCAATATCACCACAGAGATACAAGATAAGAAGCAGAA
AAAATATGCACTTTTTTATAGACTTGATATAGTACCAATAGATAATGATA
ATGCCAGTTATGGTAATACCAGTTATAGGTTGATAAATTGTAACACCTCA
GTCATTACACAAGCCTGTCCAAAAGTGTCCTTTGAGCCAATTCCTATACA
TTATTGTGCCCCGGCTGGTTTTGCGATTCTAAAATGTAGAGATAAGAATT
TCAATGGATCAGGAGTATGTGAAAATGTCAGCACAGTACAATGCACACAT
GGAATTAGACCAGTAGTATCAACTCAGCTGCTGTTAAATGGCAGTCTAGC
AGAAGAAGAGGTAGTGATTAGATCTGAAAATATCTCAGACAATGCTAAAA
CCATAATAGTACAGCTGAAGGAATATGTAAACATTAGTTGTATAAGACCC
CACAACAATACAAGACAAAGCATACATATAGGACCAGGGAGAGCATTTTA
TGCAACAGGAGACATAATAGGAGATATAAGACAAGCATTTTGTAACATTA
GTAGGGAGAAATGGAATTACACTTTACAACAGGTAGTTATAAAATTAAGA
GAACAGGAATTGTTTAGAAATAAAACAATAGCCTTTAAGCCATCCTCAGG
AGGGGACCCAGAAATTGTAATGCACAGTTTTAATTGTGGAGGGGAATTTT
TCTACTGTAATACAACACAGCTGTTTAATAGTACTTGGAATAGTACTTGG
ACTTTGAATGATACTAGAGGGTCAAATAACACAAATGGAACTGGTTATAT
CATACTCCCATGCAGATTAAAACAATTTATAAACATGTGGCAGAGAGTAG
GAAAAGCAATGTATGCCCCTCCCATCAGTGGACAAATTAACTGTACATCA
CACATTACAGGGCTGCTATTAACAAGAGATGGTGGTAATAACAATAGCGA
ACCCACCCAGATCTTCAGACCTGGAGGAGGAGATATGAGAGACAATTGGA
GAAGTGAATTATATAAATATAAAGTAGTAAAAATTGAACCATTAGGAGTA
GCACCCACAAAGGCAAAAAGAAGAGTGGTGCAGAGAGAAAAAAGAGCAGT
AGGACTGGGAGCTATGTTCCTTGGGTTCTTGGGAGCAGCAGGAAGCACTA
TGGGCGCAGCATCAGTGATGCTGACGGTACAGGCCAGACAGCTATTGTCT
GGCATAGTGCAACAGCAAAACAATTTGCTGAGAGCTATTGAGGCGCAACA
GCATCTGTTGCAACTCACAGTCTGGGGCATCAAGCAGCTCCAGGCAAGAC
TCCTGGCTGTGGAAAGATACCTAAAGGATCAACAGCTCCTAGGGATTTGG
GGTTGCTCTGGAAAACTCATCTGCACCACTACTGTGCCTTGGAATGCTAG
TTGGAGTAATAAATCTTTGGATATGATTTGGAATAATATGACCTGGATGC
AGTGGGAAAGAGAAATTAACAATTATACAGGCTTAATATACAACTTAATT
GAAGAATCGCAGAACCAACAAGAAAAGAATGAACTAGAATTATTAGAACT
GGACAAGTGGGACAGCTTGTGGAATTGGTTTGACATAACAAAATGGCTGT
GGTATATAAAAATATTCATAATGATAGTAGGGGGCTTGGTAGGTTTAAGA
ATAGTTTTTGCTGTACTTTCTATAGTGAATAGGGTTAGGCAGGGATATTC
ACCATTGTCGTTTCAGACCCGCTTCCCAGCACCGAGGGGACCCGACAGGC
CCGGAGAAACCGAAGAAGAAGGTGGAGAGAGAGACAGAGACAGATCCGAC
AGATTAGTGCACGGATTCTTGACACTTATCTGGGAGGATCTGAACAACCT
GTGCCTCTTCAGCTACCGCCACTTGAGAGACTTACTCTTGATTGCAGCGA
GGATTGTGGAAATTCTGGGACACAGGGGGTGGGAACTCATCAAGTATTGG
TGGAATCTCCTGCAGTATTGGAGTCAGGAACTAAAGAATAGTGCTGTAAG
CTTGCTTAACGCCACAGCTATAGCAGTAGCTGAGGGAACAGATAGGATCA
TAGAAGTAGCACAAAGAGCTTGGAGAGCTGTTCTCCACATACCTGTAAGA
ATAAGACAGGGCTTAGAAAGGGCTTTGCTATAAGATGGGTGGCAAGTGGT
CACGAAGTAGTCTAGTTGGATGGCCTGATGTAAGGGAAAGAATGAGACGA
GCTGAGCCAGCAGCAGATGGGGTGGGAGCAGCATCTCGAGACCTGGAAAA
ACATGGAGCACTCACAAGTAGTAATACACCAGCTACTAATGCTGATTGTG
CCTGGCTAGAAGCACAAGAGGAGCAGGAAGAGGTGGGTTTTCCAGTTAGA
CCTCAGGTACCTTTAAGACCAATGACTTACAAAGCAGCTCTTGATCTCAG
CCACTTTTTAAAAGAAAAGGGGGGACTGGAAGGGCTAGTTTGGTCCCAAA
AGAAACAAGATATCCTTGACCTTTGGGTCTACAACACACAAGGTTACTTC
CCTGATTGGCAGAACTACACACCAGGGCCAGGGATCAGATTGCCACTAAC
CTTTGGGTGGTGCTTCAAGCTAGTACCAGTTGAACCAGACAAGGTAGAAG
AGGCCAATGAAGGAGAGAACAACAGCTTGTTAAGCGCCATGAGCCAGCAT
GGAATGGAGGACCCAGAGAAAGAAGTGTTAATGTGGAAGTTTGACAGCCG
CCTAGCATTTCATCATGTAGCCCGAGAGAAGCATCCGGACTTTTACAAAG
ACTGCTGACAGCGAGACTACAAAGACTGCTGACATCGAGCTTTCAACAAG
GGACTTTCCGCTGGGGACTTTCCCGGGAGGCGTGGACTGGGCGGGACTGG
GGAGTGGCGAGCCCTCAGATGCTGCATATAAGCAGCTGCTTTTTGCC