-
Notifications
You must be signed in to change notification settings - Fork 50
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The relationship between the location of Mutator transposons and exons #225
Labels
Comments
Could you provide an example with the alignment data? |
The file contains a pair of homologous genes and the sequence A is ancestral. We performed de novo library building with RepeatModeler for A and B, respectively, and then merged them. However, the annotation results show that only the exon on gene B is annotated to Mutator, and these exons are extremely similar to the ancestor with a very high degree of identity. Then why Mutator was not annotated on the ancestral sequence?
I also asked Prof. Damon Lisch about this question, and he believes that Mutator is not seen in these two sequences, so I would like to know more about what is the basis for RepeatMasker to recognize Mutator?
>A
TTATGCAAGTGGAACTCGCCTTCGATATTCCCCAAGAGCCCATAAGGGGAAAATGTCTCTAAATGCTGCGTAGTTTAATCCACAAAACGTAAAAAATATCCCAGTGATTTCCTGCACGTTTTGGCAATAAACAAACATTGGGACTTTAGAAATTGTTTTTATTGAAGGAAAAGGTAAAAAAAAAAAAACGAAAAAAAAAAACACTACTCAAGACAAACACTTGAATGAACTATTATATAAATAAATGTGCTTGATTTTGTATAAATTGCATGAATTACCTGTTGAGGAAATTCACCATCTTCCAATTGTGAGTTGATCAGCACCCTTACTCCACGATGAATTGGTGTTGGATCTCTCTCAGCCTGATTAATTTTGAGAGAAAAAAAAAAAAACTGTGATTTTTAAATCTCTTAAGAGAAAAATGTTATTTATCATCATTTTAACCATAATTCTTTTAATATTTTTTGAGGTGACATCAAATGATTGGTAGATACAGGTGAAATATAACAAATATTTTTTCAATCATTTAAAGCGATGTTATCGGATGATGAGATCAGAATGATAAATAGAAACGATGACGGGTAACATTACAGCTTATTAAAAAATGACCACAGACTGACTGACCTGTCCTGCGTTAATGAGCGTTAACACTGCCCAAGCAGTTTGGACAACATTTTCTCGGTTGCCTTCAATATTTGACCACACCTGTTCAACAATCAAACTTTATTTCTCTATTTTCATTTTCCTTTTTTTTTTTTTGTTTTTCCTTGTGGTTTACACAATAAAAAATGGAATATGTTGTCAAATATATATATATATATACCTTGTTGTGGCATGAAAGGTAACTCTCTCCCCATCCACCATTTGGCAACTGCTTCGACAGCAAAAAATCACAAGCTTTGCGAATTGCAGCACAATTTTGGTAGTTTCTTCCAGAGGCTGTTAGTGACCCTACAGCAAACCATGTGCCATAGGTGTAGCAAATCCCCCAATTACCATACCTGTGTAAGTGCTAATCTTGCATAAATATCGATATTGTTGCCTTGTTGCAAGGATACTAACGTGTGAAGAAAATTCTAAAAAAAAAAAGAAAGAAATTGAAAAATTGTATTTGCGAGCCGATATGAAAAACGCGCTGATCTCCAGACTAATGACATGACAAAACTTGATTTGCAAGATAAACTTAAAAACTATATATATCTCTTACAAATCAAAATCTCTCTTATTAGCAATGTAGAGACTGTAACGCTAACCACACCGGCTTGTGAATAAAAGTACTAAAAAGGTATTGGTGATTAAATTTAGGTTTGGGGAGGATCATGAGATGAAAATTTTTAATTTTAGATAAAAGTTTAAAATATTATTTTTTAATATTAATTATTGTTTTGAGATTTGAAAAAGTTGAATTGAGATTTGAAAAAGTTGAATTGTTTATTATATTTTGTATGAGAATTTGAAAAAATTGTAATGATGAGATGAAATAAGATGAGAATTTTGTGTTTTATATTTAGTACCAAACCTAACCTAATTCGCCTTTTTTTTAATAACTGGAATAGATCAAATTATCTCATAGTAAATTAATAACATATTTAGTTATAACAGAAAATGTTTCTTTCATGGGTTATAATATTTTTTTTGGCGAGTTTTATTCCATGAGTTTCCCACGACAAAATTTTGTAGAAATAGAGTTTTTTCCACAAAAGTCTAATTTTTGCCACAAAACTCATTCATGGAAAATTGCAGTTTCAGTTGTAGTGAACTAGCAAGTTAAAGTTAGTTAAGATAGAAAGATGATCTTGCATGGTGTCTTACCATGATCCATCAGGTTCTTGTATGTCTTGAATGAATTGAATGGCCTTGGAAATGGAATTGTCTATCTCCATCCGACGGTGCTTGGGATACAATTTCCTAAAGAGTACGAGACCTTCAACTGCTGACGCAGTGCACTCAACGTACCTATAATTAATTTTTTCATCATGTAATATATATGACCAATAATGTACGTGGGAATTGCATGCTAGCAGTTGTACATGAGACGTGATCTCATCTTACTCTTTTTCAAAGAGACAATCCTCGTAGACCTCAGTTGGGTTGAACTTCTGCAGAAAGATAAACAGAATTTATGTCAAGGCCGAGTACTGTTGAGTTAGCATAAAAATGCATGGTTTATTCTCATTAAGAATGAACAGATCACCTACCTGCATCCAGGGTGATGCTTTCACAGGCTCCCATGCTGAGAAACCACCATTACTATTCTGTAGTGGAAGTTGAAAGAACGTACGTGAGTTACTGATGAGGCAATGCATGCATAAGAAAAATGAACATAATGAAGTACATTTGCAAGCTGATGCGATGAACGTATTCTTTATGCGCCACCGATCATACGATATGATTTGATTTTTTTTATCATGTTATATATCGTAGAAGGTGTGTTCTTCACACCAGCTTGTAATTTTCCTATACGTTAACGGAGTGAAAAACAAAGTATGTGAGTTGAAGTCCTACTTGTAGAGAAAGAACGACATTCACTGCATCATAAAACCGTTCAGTCTCCATTTTTTCCCCAACTAGATCGGTGGGAAATTGTGATAACATGAGTGTAGCCTGCAGAAGAGAAATTGGTTTTGCAAAAAGTCAAATTCTTCGGATAATGTGCGTGGAATTTAGTATATATGTACGTATTTTGAGATCAGAATTCAGGCATTATATATACCTTCAACCCTTCTGCTGTGACATCAGAGACTTGCCAGCCATAGTCCTGTGTTGCTAGTGTCCATGATCCTTTAGTTATGTGTCGGTACATAGCCTTGAAGTCCCCAGGAGGGTTTTCTTGCACCTACATATAAGAATATTATTGATGCATTATTCTTATTAAAACCAACTCGTCAAACAATATTTAATAAAATAAGATGAAATATAGAGTCAAAGTTCTCTCGAACTCACTAAATTAGAGATAGGTATACTGTTGTAATTATGTTTTTCCTCCGTCATAGAACAGTCAATAAAATTTGGAGTGCCGTTCTCTTTAAAAAAAAAAAAAAAAGAAAAAAAAAGAGAGAGAGGAGATATTCTGATCATCACCTGTGAAGCCTTCATGAAATCATTTGCTTTTCGAAGAGTTGTCGCGCTCTCTTCGTTTAGATTACAAGGTAGGATTGCTTGAATAGCGAAAACTGCAGACCACGTTTGACAGCCCAAACTCTGCACCACAGGAATAATGTTGGATCAAATTCTGAGTAATTTAGTCTTCATGACTAGTGAGTAATTCTTCAAGTATTTGTGATTGCATGATCCGCACGATGGCTCACTTATTCATAGATCATGTATATACTTATGCATGCATGCTTTTGTTCTTGTTCCACAAAAGATCAGGCTAGGGAAGATAATATCACCTGAAGTTTTAAGCCATCTTCTGCAACCCAATAGTAGTCAGGAAGTCTGGCTAAATGACACTTGTATGCCTCTGAATCTGGATCTTCAACCCACTGGGCCATCAAGCATAACACCTTTCATAAAAATAAATTATAAATAAGAGTTGAGTTCTTAATGTGTATCTTCTTCATTGAGATAATTGAATTGCAGCAGCTAGATAGCACTAGTTATATATATGGACCTTTTCAACGCCTCCAATGCATAAATATCTGCTGGCCTCGTCCTCATAACGTATATGATCAAAGGCAATTTTCACCGCCTTCTTTCTCACCATTGAAAAGGGCCAAACTGACAGAAAAGGCTCCACTATGTATTGAAGAAAATCCCATGCCAAATCTTGTACCAGAGGATGTGGAAAGTAGAGATCCTCCTATATATATATATATATATATATATATATTAATTAGAATTTTCGATTAGTTCTTATTTAATAAAAAATTTTAAGAGAAATGCATTTTTTGAATGAATTTTTTTATCCCAAAAACTTCGAGAAGAAAATGATATTTTTGGGGTGAAAAATTTTGTAAAAAAAATGGAACTTTTATATATGTTGTAGTGTTGTACATATTTTTGCCAAAATGTTTTGAATATAATCATGATAAACAGTTTTGAGACAGAAAAGAAAGACCTAAAAGGACGGAGATTCATATATATATATATAGTACTAACCTTTGCAATTGTATTCCTGGCTTTGTTCCAGTTAACTTGTTCATAAGGCTCGTTGTACAACTCTTGTCTTAGCGATTTAACCAACTCAGTGATTGGACCAACAAATCTCTTCCCATATAAATAAGACATTGGCATGTAAACTAAGCGAGCATAGCATAACATTTTTCCTGCAATAATATTAATTCATATAATTAGGGCCATCTAATTATTAGTGTTCATGATCTTAATGTAATTTTTAAAAGCAAAAAAGAAAAAAGAAAGATATATATATATATATACACACACACAAGTGAGAAAATCAACCTGGATTAAGGGGGATGAAATTAGGAAGAAGCCAGAACTCTGGGGGTAATGGATTACATCCCGACCACTCATACGCTCCTAGTACCTATATATATATATATATATGATTAGAAGAGGCATTATTTCAGATTGAGAAATATTTTAGCTACAAATAGATTACACAAAAGTAATATCATAAACTGACATAGTTTCATGTAATTCGTTAGGTTGTAAAATTATTTTTATTATAAAATAGATTTAACATAGGGTGAAAAACAAATTCTATAGAATCCTTTAAAATCTAATCGACCCATAAAATTATATATAGGAACTCTTAACAATTGTTCATACCGAGACCCAAAACTTTCCCCATGATGGCATTGTCACCAAACCACCATGGTCGAGGATCCATTTTCGGCCTCTATCCATGGCCCTATCTTCACCATCTTCGAGCCCCTCTCCAAGTATCCTCAAGGCAATATAGCTCAAAGCTGAGCCAAACATTGTGCTGTCTCCCACTATGTGGAAACTCCATCCTCCATCTTCATTCTGTCCAACAAACACAGTCAAATACGTGTATATATATATATATATATGTTTAATATCTACCCAAAATATATCTCAGCTTCCTAGGCCTTTTATTGAATCCAAACCTGAGTATTATATAGGTATCGAATGATTTCCTTCCGATGATGTGATGAGAACATGCGATTGAGATCCCCAGTAATAGACAATGCCATCACCTGCAAATGGTAATGACTTCAAATCAAGTTATCTATAGCAAATAACTTACCAGGATAATTTAAATATAACAAAACATATATAATCATTATAAAATAATATTCTTACAACTATTTTATTTGAATAAAAAAATTAAAATGATGCCATACCAAGGGCCCAACAAAAACCAAGGGTCCACCAAATTCTGCAGGCCAGTGGCCATCATGGGCCTGAAGGGAGGAAATGGAGCTTAGTGCTCTTCTCAGTGTAGTTGTCACTGCTTCCTCTGTTATTTCCTCTGTTTCTTGGACTTTCACTGGTGGTGGAATTGGCCCACGTTGATTCTCCTTTCTAATCTGCAAGTTTCAAAAGCCAAAAATATAGTTGGAATTTCATAACTGATCAAAATTATTCTAATCATGCAATATCATTAGGGGTTAGAAAACCAACTATATATGAGTATATGTATGAAGAAAGGGATAATTATTATTTTTTAAAATCACTCAAATTTTCAATATTCTTCCTCAAATTTAAAATTTACGGCCTATTCATTGAGATTTGGTTATTAACATAAACAAAAAAATTAGTGTTACAAATAAAAAGAGATTATACAAAATTATCAAATCCACAAATTGACATAGTTTTATTTGATCCGTTAGATATATTTTATAATAAAAATCATTTTACAATCTGACGTACCGCATCAAATCACATCAATTTATAAATTTATTTTTACGTAATCTAAACTATTTCTCTTCTAATTGTTTTTTTTATTATTATTCATGGACAAAAAATGAAGTGACGTTATAGGTAATAAAATATTTTCAATTTTTTTTTTCACTTATTTGATTCCAGCTTGGAGGGGAATAAAGATGATTATCCTCGGAAATATAAATTGCCAGAAAAGAAGGGAGGGGTGCGAGGAGATGATCTTTGCATGTGAAGTCGTTAGGGCATGGGAAAACGTGAGTGTGGGAAGACGTGATCCAACCTTGAGTATGAGTGAGTTGGAAGTCTGTTCTTGCATGCTAATTCAATACTCTATTTCAAGCCATAAATACCCATCCAGATCACCTTTTTCTCTCTATCGTAGTTTGCGGACTTGCGTATGCATGGGACGAGGGAGATCATGAGATATGATAATTTTTTTTTTTGAGTTGCATGAGATATGATATTTGGTGTAAGTGTCGGCAAATACAGATCCACATAAAAAAAAAATAACTTTTTAATAGTTAAGATTTGGAGTGTCCCACTCTTTTTCAAAATAATTATGCAACTTTTATGTATTATATATACGACTTCACGTAGCCTTAGATTATGTTTGGAAGTTTCATCTTCAATAAAATTCTCATCTCATCTCATCTCATCATTACAACATTTTCAAATTCCTATATAAAATATAATAAATAATTCAAATTTTTCAAATCCCAATACAACTTTTTCAAATTTCAATTTAACTTTTTCAAATCTCAAAACTAAAAAATAATATTTTAAACTTTAAAACAAAACACAAAATTCTCATCTTACCTTCCAAACATAATCTTATTCTTTTTTAAAAATATTTATATAAAAAAAAAAAACACAGAAATCACTTCGTCGGTGCACGTAGCACCGTACGCATACCCTTTTCGTAACTATGAGCAAATCTTGCAACAATATTGAGGTTTGAGTTTTTCAGTCCGAAACTTTTTATCGTACTATAGATATCATGATGGTCTCATGATATGTATGTCTCGTCCCTAAGTCAAACGGAGTCGATCGCATTTTGGCTATAATACCATTGCATCGAATTAAGGCTATAAGGTTTCTCTCGAGGTTATAATTTTGGCTTCTCTTTTTCCTTTAATTAATTTATATGCATGGGTTGGTCGAATGCAATCACTTATCCAATTTACTATACATGCACACCGAAATACATTTGGTGCTGTAAATGCATGTGTGCATGCACTTCAAATATATAGTATCTCTAGTACTACTCTTATATCATGCATGCAGGTATTCAAGAAACTACTATATATAAGAGATGAGATCAGAAAAAGGAATTAAGGGAAAAAAGATCAAGAGAAAAACCTGCATTCTCATCAAAAGATCACAACTTTGTTTCATCTTAAACCGATTTTTCTTGTATTCCTCACGGACCCTTTCAACTTCAGCATGTTCTTCCGGTGTACCAGCATTAGGGTCGAATTCCCAGTGTTCTCGGCCGATGAAATTGTTTACGCTCACCAAATCGGGGCCTCCTTGGGACACTTTCAACTTCCACAT
>B
TAAATATATATATATATATATTTATATTGGAGACTTCAGTGCTGCTATCTGTCGAATTACTTCAATATGCAGAACTTTTGTTTTGCATTTCATGCACTATTATGCAAGTAGAACTCGCCTTCGATATTCCCCAAGAGCCCATAAGGGGAAAATGTCTCTAAATGCTGCGTAGTTTAAGCCACAAAACGTAAAAAATATCCCAGTGATTTCCTGCACGTTGGCAATAAACAAACATTGGGACTTTAGAAAATTGTTTTTATTGAAGGAAAAGGAAAAAGGAAAAAAAAGAAAAAGAAAAAGAAAATAAAACTCTACTCAAGACAAACACTTGAATGAAGTATTAAATAAATGTGCTTGATTTTGTATAAATTGCATGAATTACCTGTTGAGGGAATTCACCATCTTCCAATTGTGAGTTGATCAGTACCCTTACTCCACGATGAATTGGTGTTGGATCTCTCTCAGCCTGATTTTGAGAAAAAAAAAAAAACTGTGATTTTTAAATCTCTTAAGAGAAAAATGTTATTTATCATCATTTTAACTATCATTTTTTTAATATTTTTGGACGTGACATCAAATGATTGGTAGACCGGTAAAATATAACAAATACTTTTTCAATCATCTAAAGCGATCGACGTTATCGGATAATGAGATCAGAATGATAAATAGAAACGATGACGAGTAACATTACATCTTATTAGAAAATGACCATAGACTGACCTGTCCTGCGTTAACGAGTGTTAACACTGCCCAAGCAGTCTGGACAACATTTTCTCGGTTGCCTTCAATATTTGACCACACCTGTTCAACAATCAAACTTTATTTCTCTATTTTCATTTCCTTATTTTTGTTTTGTTTTTCCTTGTGGTTTACACATTAAAAAATGGAACAAGACGTTGTCCTATATATATATATATATGTATATAAATAATATATACCTTGTTGTGGCATGAAAGGTAACTCTCTCCCCATCCACCATTTGGCAATTGCTTTGACAGCAAAAAATCACAAGCTTTGCGAATTGCAGCACAATTTTGGTAGTTTTTTCCAGAGGCTGTTAGTGCCCCTACAGCAAACCATGTGCCATAGGTGTAGCAAATCCCCCAATTACCATACCTGTGTAAGTGCTAATCTTGCATAAATATCCATATTGTTTCCTTGTTGCAAGGATACTAACATGTGAAAAAAGTTATAAATATCGATATGAAAAACACGCTGATCTCCAGACTAATCATGACAATATGACATGAAGACTTGATTTGCAAGATAAATTTAAAAACTAGGGACTGGTTTGGTTACACAAAACTAAATCATTTTATTTCATAAAATCATTATAAAATTTTCAAACTCCCATATAAAATATAATAAAAAATTCAAAATTTTCAGATTTCAAAATAAAAATAATATTAAAAAATTTATATTATAATAATATTCTATTCAACTTTTAACAAAACATATTATCTTATCTCATCTGAACTGTGTAACCAAACGAGACCTTGCAAATGCTATCCACACCGGCTTGCGAATAAAAGTACTCAAAAAGTATTGGTGATTAATTCGCCTTTTTTTATTTTTAAATAACTGGAATAGATCAGATTGTCTCATAGTAAATTAATAACGTATTTAGTTATAACAGAAAATGTTTCTTTTATAATTTTCAAGGTCTATGGCTAGCAAGTTAAAGTTAGTTAAGATAGAAAGGTCTTGCATGATGTATTACCATGATCCATCAGGTTCTTGTATGTCTTGAATGAATTGAATGGCCTTGGAAATGGAATTGTCTATCTCCATCCGACGGTGCTTGGGATACAATTTCCTAAAGAGTACGAGACCTTCAACTGCTGACGCAGTGCACTCCACGTACCTATAATTAATTTTTTCATCATGTAATATATATGACCAATAATGTACGTGGGAATTGCATGCTAGCAGTTGTACATGAGACGTGATCTCATCTTACTCTTTTTCAAAGAGACAATCCTCGTAGACCTCAGTTGGGTTGAACTTCTGCAGCAAGATATACAGAATTTATGTCAAGGCCGATTACTGCTGAGTTAGCATAAAAATGCATGCATGGTTTATTCTCATTAAGAATGAACAGATCACCTACCTGCATCCAGGGGGATGCTTTCACAGGCTCCCATGCTGAGAAACCACCATTACTATTCTGTAGTGGAAGTTTAAAGAACGTCATGAGTTACTGATCAGGCAATGCATGCATTAGAAAAATGAACATAATGAAGTTAATCATGTTGATGAATAATTCTAAGTTACTTGTAGATAAAGTCTTGGGTATGTTTATAAGAAATGTACAATTTTTTCTTGTAGAACTGGTTTTATGAGATAGTTGGCCATAAATTTCTTCAAATCCCGTACGTGGAGAATTATGCATTTGCAAGCTGATGCGGAGAACGTATTCTTTACGCCACTGATCATACGATATGATTTGATTTTTTTATTTTTTATCATGTTATAGCGTAGAAGGTGCGTTCTTCACACCAGCTTGTAATTTTCCTATACGTTAACGGAGTGAAAAACAAAGTATGTGATCAGTTGAAGTACTACTTGTAGAGAAAGAATGACATTCACTGCATCATAAAACCGTTCAGTCTCCATTTTTTCCCCAACTAGATCGGTGGGCAATTGTGATAACATGAATGTAGCCTGCAGAAGAGAAATTCGTTTTGCAAAAATCAAATTCTTCGATAATGTGCTTGGAATTTTGTATACATGTACGTATTTTGAGGTCAGAATTCAGGCATTTTATATACCTTCAACCCTTCTGCTGTGACATCAGAGACTTGCCAGCCATAGTCCTGTGTTGCTAGTGTCCAGGATCCTTTAGTTATGTGTCGGTACATAGCCTTGAAGTCCCCGGGAGGGTCTTCTTGCACCTACGTATAATAAGAATAAGAATAATATTGATACATTATATATTCTTATTAAAACCAACTCGTCAAACAATATTTAATAAAATAAGACGAAATATAGAGTCAAAGTTTCGAGCTCACTAAATCATTAATTATCTTGAGCTCACTAAATCACAACAGGAGATAAGTATCTTGTTGTAACACGATTTTCCTCCGTCATAAGTGAGGACAGTCAATAAGATTTGGGTACCGTCTTTTAAAAAAGAAAAAGAAAAAAAGAGAGATAATACTGTGATCATCACCTGTGAAGCTTTCATGAAATCATATGCTTTTCGAAGAGTTGGCGCGCACTCTTCGTTTAGATTACAATGTAGGATTGCTTGAATAGCGAAAACTGCAGACCACGTTTGACAGCCCAAACTCTGCACCACAGGAATAATGTTGGATCAAATTCTGAGTTATTTAGTCTTCATGACTAGTGAGTAATTCTTCAAGTATTTGTCATTAATGCATGAGCATGATGGCTCACTTTTTCATAGATCATGTATGTACTTATGCATGCTTGCTTTTGTTCTTGTTCCACAAAAGATCAGGCTAGGGAAGATAATATCACCTGAAGTTTTAAGCCATCTTCTGCAACCCAATAGTAATCAGGAAGTCTGGCTAAATGACACTTGTAAGCCTCTGAATCTGGATCTTCAACCCACTGGGCCATCAAGCATAACACCTTTCATAAAAATAAATAAATAAGAGTTAAGTTCTTAATGTACTGGATCTTCTTAATTCATTGAGATTGAATTGCATGCAGAAGCTAGATAGCACTAGTTATATATATGGACCTTTTCAACGCCTCCAATGCATAAATATCTGCTGGCCTCGTCCTCATAACGTATATGATCAATGGCAATTTTCACCGCCTTCTCTCTCACCATTGAAAAGGGCCAAACTGACAGGAAAGGCTCCACTACGTATTGAAGAAAATCCCATGCCAAATCTTGTACCAGAGGATGTGGAAAGTAGAGATCCTCCTATATATATATATATTAATTAGAACTTTCGATTAGTTCTTATGTAATAAAAAAATTTAAGAGAAATGCATTTTTTGAATGAAAATGATATTTTTGGGGCGAAAAATTTTGTAAAAAAAATAGAACTTTTATATATGTTGTAGTGTTATACATATTTTTGCCAAAATGTTGTGAAATATAATCATGATAAACAGTTTGGAGACAGAAAAGAAAGACCTAAAAGGACGGAGATTCATATATATATATATATATATATATTTAGTACTAACCTTTGCAATTGTGTTCCTGGCTTTGTTCCAGTTAACTTGTTCATAAGGCTCGTTGTACAACTCTTGCCTTAGCGATTTGACCAACTCAGTGATTGGACCAACAAATCTCTTCCCATATAAATAAGACATTGGCATGTAAACTAAGCGAGCATAGCATAACATTTTTCCTGCAATAATATTAATTCATATAATTAAGGCCATCTAATTATTAGTGTTCATCTTAATGTAATTTTTCAAAGCAAAAAAGAAAAAAGAAAGAAAGAAATATGTATATACAAGTAGTGAGAAAATCAACCTGGATTAAGGGGGAAGAAATCAGGAAGCAGCCAGAACTCTGGGGGTAATGGATTACATCCCGACCACTCATACGCTCCTAGTACCTATACATATATGATTAGAAGATGCATTATTTCAGATTGAGAAATACTTTAGCCACAAATGGTTTACACAAAAGTAATCTCATAAACTAACATAGTTTCTTGTGATTTGTCAGATTGTAAAGTTATTTTTATTATAAAATAGATCTAACGGATTATATGAAAGCAAATAATAATAATAAAAATTCTATATGGTGAAAAACAAATTCTATAGAATCCTTTAAAATCTGATCGACCCATAAAATTATATATAGGAACTCTTAACAATTGTTCATACCGAGACCCAAAACTTTCCCCATGATGGCATTGCCACCAAACCACCATGGTCGAGGATCCATTTTCGGCCTCTATCCATGGCCCTATCTTCACCATCTTCGAGCCCCTCTCCAAGTATCCTCAAGGCAATATAGCTCAAAGCTGAGCCAAACATTGTGCTGTCTCCCACTATGTGAAAACTCCATCCTCCATCTTCATTCTGTCCAACAAACACAGTAAAATACACACACACACACACATGAATATATATATATATATTTAATATCTACCCAAAATATATCTCAGCTTCCTTGAACTTTTATTGAATCCAAACCTGAGTATTATATAGGTATCGAATGATTTCCTTCCGATGATGTGATGAGAACATGCGATCGAGATCCCCAGTAATAGACAATGCCATCACCTGCAAATGGTAGTAACTTCTATTTAAGTTTTCTACAGCAAATAACTTACCAGGATGATTTATTTATAAGCCATTTAAATAAAACAAAACATAATCATTATAAAATAATACTCTTATAACTCTTTTATTTGAATAAAAAATGCCATACCAAGGGCCCAACAAAAACCAAGGGTCCACCAAATTCAGCAGGCCAGTGGCCATCATGGGCCTGAAGGGAGGAAGTGGAGCTTAGTGCTCTTCTCAGTGTAGTTGTCACTGCTTCCTCTGTTATTTCCTCTGTTTCTTGGACTTTCACTGGTGGTGGAATTTGCCCACGTTGATTCTCCTTTCTGATCTGCAAGTTCAAAAGCCAAAAATATAATTGGAATTTCACAACTGATCAAAATTATTCTAATCATGCAATATCATTAAGGGTTAGAAAACCATCTATCATGACTATATATATGAATAACGGGATAATTTATAGGATAATTATTATTTTTTAAAATCACTAAAATTTTCAATATTCTACCTCAAATTTAAAATTTACCTATTCATTGAGATTTGGTCATTAACATAAATAAAAAAATAAGTACTACAGATACAAAGAAATTATACAAAATTATCAAATCCACAAACTGATGTGGTTTTATTTGATCCGTTAGATGTATTTTATAATAAAAATAACTTTACAATCTGACGTATAACATCAAGCCATATCAGTTTGTAAGTTTATTTTTATGTAATTTTTTTATGGCTAAACTATTTCTCTTGTAATTATTTTTTATTATTATTCATGGACAAAAAATGAAGTGACGTTATAGGTAATAAAATATTTTCAATTTTTTATTCACTTATGTGATTCCAGCTTCGAGGGGAATAAAGATGATTATCCTAGGAAATATAAATTGCCAGAAAAGAAGGGAGGGGTGGGAGGAGATGAGATGATCTTTGCATGTGAGGTCGTTAGTGCATGGGAAAATGTGGGTGTAGGAAAACATGGGTGTGCATTTGATCCAACCTTGAGTATGAGTGACTTGGAAGTCTGTTCTTGCATGCTAATTCTATACTCTGTTTCAAGGCAGAAATACTCATCCAGATCACCCTTTCCTCTCTATCGTAGTTTGCGGACTTGTGTATGCATGGGGACAAGGGACATGATGAGATATGATATTTGCAATCGTGGAATACTGCATAAGTGTAGTACAAACTTTTTGAAAAAGTAAGTAAATATGAGATTTACATAAAAAAATTAATTTTTTAATAATAGAAACGTTGAGTACCACACTCTTTTTTAAAATGATTACATGGTGCTTATGCACTACACGACTTCACATAGCATTACTCTTTTTTTAAAAATATTTATTAAAAAAAAACTTGAGCAAATCTTGCAACCATATTGAGGTACGAGTTTTTCACTCCGAAACTTTTTATCGTACTATATATCATGATGGTCTCATGATATATATGTATGTCTCGTCCACCATCATGATATATATGTATGTCTCGTCCACCATCATGATATATTGTATGTCTCGTCCCTAAGTCAAACGGAGTCGATCACATTTTGGCTATAATACCAATATTGCATCGAAGGCTATAAGGTTTCTCTCGAGGGTACGTATATACTTTGGCTTCTCTTTTTCCTTTAATATATGTGCATGGGTTGGTCGAATGCAATCACCTGTCCAATTTGATATATATACATGCATAGCGAAATACATTTGGTGTAGTAAATGCATGTGCACTTCAAATATATAGTATCTCTAGTACTCTTATATCATGCATGGTATTCAAGAAATTACTACTATAAGAGATTATCAGATCAGAAAAAGGAATAAAGGGAAAAAAGATCAAGAGAAGAATTAACCTGCATTCTCATCAAAAGATCACAACTTTGTTTCATCTTAAACCGATTTTTCTTGTATTCCTCACGGACCCTTTCAACTTCAGCATGTTCTTCAGGTGTACCGGCATTAGGGTCGAATTCCCAGTGTTCTCGCCCGATGAAATTGTTTACGCTCACCAAATCAGGGCCTCCTTGGGATACTTTCAACTTCCACATCCTGATGATCTATATATATATAAGTACTACAGATCATAACGTTCCTATATTCAGACAAGATCATCACTGATATATATAACCCTAAACAACTAGGAATTAGCATTTCATAAAAAGATGAGGGAATATATAGCTAACTAGGAATCTTCTCAAGAAAGGTTTAAGATAGTTATATGTCAAAAGTATTAATCATAATTAAATTACGTTACTTAAATTATTATAAAAAATTAAAAATTATCAATTGCAAGCCATGTGTAATTAAGCTCATGATAATCACTAGCTAGCTAGCTACGTACCCTCGTATGGAAGGAAGCTTGAACCTGAAATTAACTAGCATTAGCTAGGCACCTTGTTTTAGCGAAATTTTGAAAAACAGAAAAAGATTACACACGATGAATTAATTAATTATATATATATATATAAATATATATTATATTTGAATTTAGAAATTTATACGGCAAAACTAATTTTAAAAAATAGATTTAAAAAAAAGATTTAGGGTAAGATTGTAATTTTAATAGGAACTTGATCAATATATATGATTTCTTCTACAAAATATGACTATTATAGACTTAATTAGGGTTGTTAGGGGGGTTATAATATTCATTTTTTTTTCGGCAAAAGGGAACAAAAGCCTAAAAAAATAAAAAATTGAGGAGCCTTGGCCCCCCAACCGTCCCTGATCTTTACATGAGTGCAAGGAGCATGGCTTGTTGGCTGGAATGAACGAACTATACATGCATGCATATGAGATGGATTGGATGGAAATGATCATCATGATTTGCATATTTTGATCCCTCATCAACATCGATCATACTCTTTAGTCGAAAGTATCATCAGCAAATATTATTATATACATATATATATATATATAATGCAAATTATATAATTAAGAAAGATCAGAACAATCAAATTAATTTGTTGCCATGAAAAGTTGAGATCGAGCGAGAAATGAATGGAAAGAAGCAGAGAGATATTTCTTGAATACCTTTAAATTAGCTTAGAGATCAGCAAATTTATAAGAGGCCGGTCGAGGAAGAACACATTCGCTAGCTACTTCTTGAGATGCATGCAATAATATTAGCACAAGTAGTACTGCTAGCACTTGGCCTTTGGTAGAGGCTTATCCCCTTT
…------------------ 原始邮件 ------------------
发件人: "rmhubley/RepeatMasker" ***@***.***>;
发送时间: 2023年7月22日(星期六) 凌晨2:14
***@***.***>;
***@***.******@***.***>;
主题: Re: [rmhubley/RepeatMasker] The relationship between the location of Mutator transposons and exons (Issue #225)
Could you provide an example with the alignment data?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
The file contains a pair of homologous genes and the sequence A is ancestral. We performed de novo library building with RepeatModeler for A and B, respectively, and then merged them. However, the annotation results show that only the exon on gene B is annotated to Mutator, and these exons are extremely similar to the ancestor with a very high degree of identity. Then why Mutator was not annotated on the ancestral sequence?
I also asked Prof. Damon Lisch about this question, and he believes that Mutator is not seen in these two sequences, so I would like to know more about what is the basis for RepeatMasker to recognize Mutator?
>A
TTATGCAAGTGGAACTCGCCTTCGATATTCCCCAAGAGCCCATAAGGGGAAAATGTCTCTAAATGCTGCGTAGTTTAATCCACAAAACGTAAAAAATATCCCAGTGATTTCCTGCACGTTTTGGCAATAAACAAACATTGGGACTTTAGAAATTGTTTTTATTGAAGGAAAAGGTAAAAAAAAAAAAACGAAAAAAAAAAACACTACTCAAGACAAACACTTGAATGAACTATTATATAAATAAATGTGCTTGATTTTGTATAAATTGCATGAATTACCTGTTGAGGAAATTCACCATCTTCCAATTGTGAGTTGATCAGCACCCTTACTCCACGATGAATTGGTGTTGGATCTCTCTCAGCCTGATTAATTTTGAGAGAAAAAAAAAAAAACTGTGATTTTTAAATCTCTTAAGAGAAAAATGTTATTTATCATCATTTTAACCATAATTCTTTTAATATTTTTTGAGGTGACATCAAATGATTGGTAGATACAGGTGAAATATAACAAATATTTTTTCAATCATTTAAAGCGATGTTATCGGATGATGAGATCAGAATGATAAATAGAAACGATGACGGGTAACATTACAGCTTATTAAAAAATGACCACAGACTGACTGACCTGTCCTGCGTTAATGAGCGTTAACACTGCCCAAGCAGTTTGGACAACATTTTCTCGGTTGCCTTCAATATTTGACCACACCTGTTCAACAATCAAACTTTATTTCTCTATTTTCATTTTCCTTTTTTTTTTTTTGTTTTTCCTTGTGGTTTACACAATAAAAAATGGAATATGTTGTCAAATATATATATATATATACCTTGTTGTGGCATGAAAGGTAACTCTCTCCCCATCCACCATTTGGCAACTGCTTCGACAGCAAAAAATCACAAGCTTTGCGAATTGCAGCACAATTTTGGTAGTTTCTTCCAGAGGCTGTTAGTGACCCTACAGCAAACCATGTGCCATAGGTGTAGCAAATCCCCCAATTACCATACCTGTGTAAGTGCTAATCTTGCATAAATATCGATATTGTTGCCTTGTTGCAAGGATACTAACGTGTGAAGAAAATTCTAAAAAAAAAAAGAAAGAAATTGAAAAATTGTATTTGCGAGCCGATATGAAAAACGCGCTGATCTCCAGACTAATGACATGACAAAACTTGATTTGCAAGATAAACTTAAAAACTATATATATCTCTTACAAATCAAAATCTCTCTTATTAGCAATGTAGAGACTGTAACGCTAACCACACCGGCTTGTGAATAAAAGTACTAAAAAGGTATTGGTGATTAAATTTAGGTTTGGGGAGGATCATGAGATGAAAATTTTTAATTTTAGATAAAAGTTTAAAATATTATTTTTTAATATTAATTATTGTTTTGAGATTTGAAAAAGTTGAATTGAGATTTGAAAAAGTTGAATTGTTTATTATATTTTGTATGAGAATTTGAAAAAATTGTAATGATGAGATGAAATAAGATGAGAATTTTGTGTTTTATATTTAGTACCAAACCTAACCTAATTCGCCTTTTTTTTAATAACTGGAATAGATCAAATTATCTCATAGTAAATTAATAACATATTTAGTTATAACAGAAAATGTTTCTTTCATGGGTTATAATATTTTTTTTGGCGAGTTTTATTCCATGAGTTTCCCACGACAAAATTTTGTAGAAATAGAGTTTTTTCCACAAAAGTCTAATTTTTGCCACAAAACTCATTCATGGAAAATTGCAGTTTCAGTTGTAGTGAACTAGCAAGTTAAAGTTAGTTAAGATAGAAAGATGATCTTGCATGGTGTCTTACCATGATCCATCAGGTTCTTGTATGTCTTGAATGAATTGAATGGCCTTGGAAATGGAATTGTCTATCTCCATCCGACGGTGCTTGGGATACAATTTCCTAAAGAGTACGAGACCTTCAACTGCTGACGCAGTGCACTCAACGTACCTATAATTAATTTTTTCATCATGTAATATATATGACCAATAATGTACGTGGGAATTGCATGCTAGCAGTTGTACATGAGACGTGATCTCATCTTACTCTTTTTCAAAGAGACAATCCTCGTAGACCTCAGTTGGGTTGAACTTCTGCAGAAAGATAAACAGAATTTATGTCAAGGCCGAGTACTGTTGAGTTAGCATAAAAATGCATGGTTTATTCTCATTAAGAATGAACAGATCACCTACCTGCATCCAGGGTGATGCTTTCACAGGCTCCCATGCTGAGAAACCACCATTACTATTCTGTAGTGGAAGTTGAAAGAACGTACGTGAGTTACTGATGAGGCAATGCATGCATAAGAAAAATGAACATAATGAAGTACATTTGCAAGCTGATGCGATGAACGTATTCTTTATGCGCCACCGATCATACGATATGATTTGATTTTTTTTATCATGTTATATATCGTAGAAGGTGTGTTCTTCACACCAGCTTGTAATTTTCCTATACGTTAACGGAGTGAAAAACAAAGTATGTGAGTTGAAGTCCTACTTGTAGAGAAAGAACGACATTCACTGCATCATAAAACCGTTCAGTCTCCATTTTTTCCCCAACTAGATCGGTGGGAAATTGTGATAACATGAGTGTAGCCTGCAGAAGAGAAATTGGTTTTGCAAAAAGTCAAATTCTTCGGATAATGTGCGTGGAATTTAGTATATATGTACGTATTTTGAGATCAGAATTCAGGCATTATATATACCTTCAACCCTTCTGCTGTGACATCAGAGACTTGCCAGCCATAGTCCTGTGTTGCTAGTGTCCATGATCCTTTAGTTATGTGTCGGTACATAGCCTTGAAGTCCCCAGGAGGGTTTTCTTGCACCTACATATAAGAATATTATTGATGCATTATTCTTATTAAAACCAACTCGTCAAACAATATTTAATAAAATAAGATGAAATATAGAGTCAAAGTTCTCTCGAACTCACTAAATTAGAGATAGGTATACTGTTGTAATTATGTTTTTCCTCCGTCATAGAACAGTCAATAAAATTTGGAGTGCCGTTCTCTTTAAAAAAAAAAAAAAAAGAAAAAAAAAGAGAGAGAGGAGATATTCTGATCATCACCTGTGAAGCCTTCATGAAATCATTTGCTTTTCGAAGAGTTGTCGCGCTCTCTTCGTTTAGATTACAAGGTAGGATTGCTTGAATAGCGAAAACTGCAGACCACGTTTGACAGCCCAAACTCTGCACCACAGGAATAATGTTGGATCAAATTCTGAGTAATTTAGTCTTCATGACTAGTGAGTAATTCTTCAAGTATTTGTGATTGCATGATCCGCACGATGGCTCACTTATTCATAGATCATGTATATACTTATGCATGCATGCTTTTGTTCTTGTTCCACAAAAGATCAGGCTAGGGAAGATAATATCACCTGAAGTTTTAAGCCATCTTCTGCAACCCAATAGTAGTCAGGAAGTCTGGCTAAATGACACTTGTATGCCTCTGAATCTGGATCTTCAACCCACTGGGCCATCAAGCATAACACCTTTCATAAAAATAAATTATAAATAAGAGTTGAGTTCTTAATGTGTATCTTCTTCATTGAGATAATTGAATTGCAGCAGCTAGATAGCACTAGTTATATATATGGACCTTTTCAACGCCTCCAATGCATAAATATCTGCTGGCCTCGTCCTCATAACGTATATGATCAAAGGCAATTTTCACCGCCTTCTTTCTCACCATTGAAAAGGGCCAAACTGACAGAAAAGGCTCCACTATGTATTGAAGAAAATCCCATGCCAAATCTTGTACCAGAGGATGTGGAAAGTAGAGATCCTCCTATATATATATATATATATATATATATATTAATTAGAATTTTCGATTAGTTCTTATTTAATAAAAAATTTTAAGAGAAATGCATTTTTTGAATGAATTTTTTTATCCCAAAAACTTCGAGAAGAAAATGATATTTTTGGGGTGAAAAATTTTGTAAAAAAAATGGAACTTTTATATATGTTGTAGTGTTGTACATATTTTTGCCAAAATGTTTTGAATATAATCATGATAAACAGTTTTGAGACAGAAAAGAAAGACCTAAAAGGACGGAGATTCATATATATATATATAGTACTAACCTTTGCAATTGTATTCCTGGCTTTGTTCCAGTTAACTTGTTCATAAGGCTCGTTGTACAACTCTTGTCTTAGCGATTTAACCAACTCAGTGATTGGACCAACAAATCTCTTCCCATATAAATAAGACATTGGCATGTAAACTAAGCGAGCATAGCATAACATTTTTCCTGCAATAATATTAATTCATATAATTAGGGCCATCTAATTATTAGTGTTCATGATCTTAATGTAATTTTTAAAAGCAAAAAAGAAAAAAGAAAGATATATATATATATATACACACACACAAGTGAGAAAATCAACCTGGATTAAGGGGGATGAAATTAGGAAGAAGCCAGAACTCTGGGGGTAATGGATTACATCCCGACCACTCATACGCTCCTAGTACCTATATATATATATATATATGATTAGAAGAGGCATTATTTCAGATTGAGAAATATTTTAGCTACAAATAGATTACACAAAAGTAATATCATAAACTGACATAGTTTCATGTAATTCGTTAGGTTGTAAAATTATTTTTATTATAAAATAGATTTAACATAGGGTGAAAAACAAATTCTATAGAATCCTTTAAAATCTAATCGACCCATAAAATTATATATAGGAACTCTTAACAATTGTTCATACCGAGACCCAAAACTTTCCCCATGATGGCATTGTCACCAAACCACCATGGTCGAGGATCCATTTTCGGCCTCTATCCATGGCCCTATCTTCACCATCTTCGAGCCCCTCTCCAAGTATCCTCAAGGCAATATAGCTCAAAGCTGAGCCAAACATTGTGCTGTCTCCCACTATGTGGAAACTCCATCCTCCATCTTCATTCTGTCCAACAAACACAGTCAAATACGTGTATATATATATATATATATGTTTAATATCTACCCAAAATATATCTCAGCTTCCTAGGCCTTTTATTGAATCCAAACCTGAGTATTATATAGGTATCGAATGATTTCCTTCCGATGATGTGATGAGAACATGCGATTGAGATCCCCAGTAATAGACAATGCCATCACCTGCAAATGGTAATGACTTCAAATCAAGTTATCTATAGCAAATAACTTACCAGGATAATTTAAATATAACAAAACATATATAATCATTATAAAATAATATTCTTACAACTATTTTATTTGAATAAAAAAATTAAAATGATGCCATACCAAGGGCCCAACAAAAACCAAGGGTCCACCAAATTCTGCAGGCCAGTGGCCATCATGGGCCTGAAGGGAGGAAATGGAGCTTAGTGCTCTTCTCAGTGTAGTTGTCACTGCTTCCTCTGTTATTTCCTCTGTTTCTTGGACTTTCACTGGTGGTGGAATTGGCCCACGTTGATTCTCCTTTCTAATCTGCAAGTTTCAAAAGCCAAAAATATAGTTGGAATTTCATAACTGATCAAAATTATTCTAATCATGCAATATCATTAGGGGTTAGAAAACCAACTATATATGAGTATATGTATGAAGAAAGGGATAATTATTATTTTTTAAAATCACTCAAATTTTCAATATTCTTCCTCAAATTTAAAATTTACGGCCTATTCATTGAGATTTGGTTATTAACATAAACAAAAAAATTAGTGTTACAAATAAAAAGAGATTATACAAAATTATCAAATCCACAAATTGACATAGTTTTATTTGATCCGTTAGATATATTTTATAATAAAAATCATTTTACAATCTGACGTACCGCATCAAATCACATCAATTTATAAATTTATTTTTACGTAATCTAAACTATTTCTCTTCTAATTGTTTTTTTTATTATTATTCATGGACAAAAAATGAAGTGACGTTATAGGTAATAAAATATTTTCAATTTTTTTTTTCACTTATTTGATTCCAGCTTGGAGGGGAATAAAGATGATTATCCTCGGAAATATAAATTGCCAGAAAAGAAGGGAGGGGTGCGAGGAGATGATCTTTGCATGTGAAGTCGTTAGGGCATGGGAAAACGTGAGTGTGGGAAGACGTGATCCAACCTTGAGTATGAGTGAGTTGGAAGTCTGTTCTTGCATGCTAATTCAATACTCTATTTCAAGCCATAAATACCCATCCAGATCACCTTTTTCTCTCTATCGTAGTTTGCGGACTTGCGTATGCATGGGACGAGGGAGATCATGAGATATGATAATTTTTTTTTTTGAGTTGCATGAGATATGATATTTGGTGTAAGTGTCGGCAAATACAGATCCACATAAAAAAAAAATAACTTTTTAATAGTTAAGATTTGGAGTGTCCCACTCTTTTTCAAAATAATTATGCAACTTTTATGTATTATATATACGACTTCACGTAGCCTTAGATTATGTTTGGAAGTTTCATCTTCAATAAAATTCTCATCTCATCTCATCTCATCATTACAACATTTTCAAATTCCTATATAAAATATAATAAATAATTCAAATTTTTCAAATCCCAATACAACTTTTTCAAATTTCAATTTAACTTTTTCAAATCTCAAAACTAAAAAATAATATTTTAAACTTTAAAACAAAACACAAAATTCTCATCTTACCTTCCAAACATAATCTTATTCTTTTTTAAAAATATTTATATAAAAAAAAAAAACACAGAAATCACTTCGTCGGTGCACGTAGCACCGTACGCATACCCTTTTCGTAACTATGAGCAAATCTTGCAACAATATTGAGGTTTGAGTTTTTCAGTCCGAAACTTTTTATCGTACTATAGATATCATGATGGTCTCATGATATGTATGTCTCGTCCCTAAGTCAAACGGAGTCGATCGCATTTTGGCTATAATACCATTGCATCGAATTAAGGCTATAAGGTTTCTCTCGAGGTTATAATTTTGGCTTCTCTTTTTCCTTTAATTAATTTATATGCATGGGTTGGTCGAATGCAATCACTTATCCAATTTACTATACATGCACACCGAAATACATTTGGTGCTGTAAATGCATGTGTGCATGCACTTCAAATATATAGTATCTCTAGTACTACTCTTATATCATGCATGCAGGTATTCAAGAAACTACTATATATAAGAGATGAGATCAGAAAAAGGAATTAAGGGAAAAAAGATCAAGAGAAAAACCTGCATTCTCATCAAAAGATCACAACTTTGTTTCATCTTAAACCGATTTTTCTTGTATTCCTCACGGACCCTTTCAACTTCAGCATGTTCTTCCGGTGTACCAGCATTAGGGTCGAATTCCCAGTGTTCTCGGCCGATGAAATTGTTTACGCTCACCAAATCGGGGCCTCCTTGGGACACTTTCAACTTCCACAT
>B
TAAATATATATATATATATATTTATATTGGAGACTTCAGTGCTGCTATCTGTCGAATTACTTCAATATGCAGAACTTTTGTTTTGCATTTCATGCACTATTATGCAAGTAGAACTCGCCTTCGATATTCCCCAAGAGCCCATAAGGGGAAAATGTCTCTAAATGCTGCGTAGTTTAAGCCACAAAACGTAAAAAATATCCCAGTGATTTCCTGCACGTTGGCAATAAACAAACATTGGGACTTTAGAAAATTGTTTTTATTGAAGGAAAAGGAAAAAGGAAAAAAAAGAAAAAGAAAAAGAAAATAAAACTCTACTCAAGACAAACACTTGAATGAAGTATTAAATAAATGTGCTTGATTTTGTATAAATTGCATGAATTACCTGTTGAGGGAATTCACCATCTTCCAATTGTGAGTTGATCAGTACCCTTACTCCACGATGAATTGGTGTTGGATCTCTCTCAGCCTGATTTTGAGAAAAAAAAAAAAACTGTGATTTTTAAATCTCTTAAGAGAAAAATGTTATTTATCATCATTTTAACTATCATTTTTTTAATATTTTTGGACGTGACATCAAATGATTGGTAGACCGGTAAAATATAACAAATACTTTTTCAATCATCTAAAGCGATCGACGTTATCGGATAATGAGATCAGAATGATAAATAGAAACGATGACGAGTAACATTACATCTTATTAGAAAATGACCATAGACTGACCTGTCCTGCGTTAACGAGTGTTAACACTGCCCAAGCAGTCTGGACAACATTTTCTCGGTTGCCTTCAATATTTGACCACACCTGTTCAACAATCAAACTTTATTTCTCTATTTTCATTTCCTTATTTTTGTTTTGTTTTTCCTTGTGGTTTACACATTAAAAAATGGAACAAGACGTTGTCCTATATATATATATATATGTATATAAATAATATATACCTTGTTGTGGCATGAAAGGTAACTCTCTCCCCATCCACCATTTGGCAATTGCTTTGACAGCAAAAAATCACAAGCTTTGCGAATTGCAGCACAATTTTGGTAGTTTTTTCCAGAGGCTGTTAGTGCCCCTACAGCAAACCATGTGCCATAGGTGTAGCAAATCCCCCAATTACCATACCTGTGTAAGTGCTAATCTTGCATAAATATCCATATTGTTTCCTTGTTGCAAGGATACTAACATGTGAAAAAAGTTATAAATATCGATATGAAAAACACGCTGATCTCCAGACTAATCATGACAATATGACATGAAGACTTGATTTGCAAGATAAATTTAAAAACTAGGGACTGGTTTGGTTACACAAAACTAAATCATTTTATTTCATAAAATCATTATAAAATTTTCAAACTCCCATATAAAATATAATAAAAAATTCAAAATTTTCAGATTTCAAAATAAAAATAATATTAAAAAATTTATATTATAATAATATTCTATTCAACTTTTAACAAAACATATTATCTTATCTCATCTGAACTGTGTAACCAAACGAGACCTTGCAAATGCTATCCACACCGGCTTGCGAATAAAAGTACTCAAAAAGTATTGGTGATTAATTCGCCTTTTTTTATTTTTAAATAACTGGAATAGATCAGATTGTCTCATAGTAAATTAATAACGTATTTAGTTATAACAGAAAATGTTTCTTTTATAATTTTCAAGGTCTATGGCTAGCAAGTTAAAGTTAGTTAAGATAGAAAGGTCTTGCATGATGTATTACCATGATCCATCAGGTTCTTGTATGTCTTGAATGAATTGAATGGCCTTGGAAATGGAATTGTCTATCTCCATCCGACGGTGCTTGGGATACAATTTCCTAAAGAGTACGAGACCTTCAACTGCTGACGCAGTGCACTCCACGTACCTATAATTAATTTTTTCATCATGTAATATATATGACCAATAATGTACGTGGGAATTGCATGCTAGCAGTTGTACATGAGACGTGATCTCATCTTACTCTTTTTCAAAGAGACAATCCTCGTAGACCTCAGTTGGGTTGAACTTCTGCAGCAAGATATACAGAATTTATGTCAAGGCCGATTACTGCTGAGTTAGCATAAAAATGCATGCATGGTTTATTCTCATTAAGAATGAACAGATCACCTACCTGCATCCAGGGGGATGCTTTCACAGGCTCCCATGCTGAGAAACCACCATTACTATTCTGTAGTGGAAGTTTAAAGAACGTCATGAGTTACTGATCAGGCAATGCATGCATTAGAAAAATGAACATAATGAAGTTAATCATGTTGATGAATAATTCTAAGTTACTTGTAGATAAAGTCTTGGGTATGTTTATAAGAAATGTACAATTTTTTCTTGTAGAACTGGTTTTATGAGATAGTTGGCCATAAATTTCTTCAAATCCCGTACGTGGAGAATTATGCATTTGCAAGCTGATGCGGAGAACGTATTCTTTACGCCACTGATCATACGATATGATTTGATTTTTTTATTTTTTATCATGTTATAGCGTAGAAGGTGCGTTCTTCACACCAGCTTGTAATTTTCCTATACGTTAACGGAGTGAAAAACAAAGTATGTGATCAGTTGAAGTACTACTTGTAGAGAAAGAATGACATTCACTGCATCATAAAACCGTTCAGTCTCCATTTTTTCCCCAACTAGATCGGTGGGCAATTGTGATAACATGAATGTAGCCTGCAGAAGAGAAATTCGTTTTGCAAAAATCAAATTCTTCGATAATGTGCTTGGAATTTTGTATACATGTACGTATTTTGAGGTCAGAATTCAGGCATTTTATATACCTTCAACCCTTCTGCTGTGACATCAGAGACTTGCCAGCCATAGTCCTGTGTTGCTAGTGTCCAGGATCCTTTAGTTATGTGTCGGTACATAGCCTTGAAGTCCCCGGGAGGGTCTTCTTGCACCTACGTATAATAAGAATAAGAATAATATTGATACATTATATATTCTTATTAAAACCAACTCGTCAAACAATATTTAATAAAATAAGACGAAATATAGAGTCAAAGTTTCGAGCTCACTAAATCATTAATTATCTTGAGCTCACTAAATCACAACAGGAGATAAGTATCTTGTTGTAACACGATTTTCCTCCGTCATAAGTGAGGACAGTCAATAAGATTTGGGTACCGTCTTTTAAAAAAGAAAAAGAAAAAAAGAGAGATAATACTGTGATCATCACCTGTGAAGCTTTCATGAAATCATATGCTTTTCGAAGAGTTGGCGCGCACTCTTCGTTTAGATTACAATGTAGGATTGCTTGAATAGCGAAAACTGCAGACCACGTTTGACAGCCCAAACTCTGCACCACAGGAATAATGTTGGATCAAATTCTGAGTTATTTAGTCTTCATGACTAGTGAGTAATTCTTCAAGTATTTGTCATTAATGCATGAGCATGATGGCTCACTTTTTCATAGATCATGTATGTACTTATGCATGCTTGCTTTTGTTCTTGTTCCACAAAAGATCAGGCTAGGGAAGATAATATCACCTGAAGTTTTAAGCCATCTTCTGCAACCCAATAGTAATCAGGAAGTCTGGCTAAATGACACTTGTAAGCCTCTGAATCTGGATCTTCAACCCACTGGGCCATCAAGCATAACACCTTTCATAAAAATAAATAAATAAGAGTTAAGTTCTTAATGTACTGGATCTTCTTAATTCATTGAGATTGAATTGCATGCAGAAGCTAGATAGCACTAGTTATATATATGGACCTTTTCAACGCCTCCAATGCATAAATATCTGCTGGCCTCGTCCTCATAACGTATATGATCAATGGCAATTTTCACCGCCTTCTCTCTCACCATTGAAAAGGGCCAAACTGACAGGAAAGGCTCCACTACGTATTGAAGAAAATCCCATGCCAAATCTTGTACCAGAGGATGTGGAAAGTAGAGATCCTCCTATATATATATATATTAATTAGAACTTTCGATTAGTTCTTATGTAATAAAAAAATTTAAGAGAAATGCATTTTTTGAATGAAAATGATATTTTTGGGGCGAAAAATTTTGTAAAAAAAATAGAACTTTTATATATGTTGTAGTGTTATACATATTTTTGCCAAAATGTTGTGAAATATAATCATGATAAACAGTTTGGAGACAGAAAAGAAAGACCTAAAAGGACGGAGATTCATATATATATATATATATATATATTTAGTACTAACCTTTGCAATTGTGTTCCTGGCTTTGTTCCAGTTAACTTGTTCATAAGGCTCGTTGTACAACTCTTGCCTTAGCGATTTGACCAACTCAGTGATTGGACCAACAAATCTCTTCCCATATAAATAAGACATTGGCATGTAAACTAAGCGAGCATAGCATAACATTTTTCCTGCAATAATATTAATTCATATAATTAAGGCCATCTAATTATTAGTGTTCATCTTAATGTAATTTTTCAAAGCAAAAAAGAAAAAAGAAAGAAAGAAATATGTATATACAAGTAGTGAGAAAATCAACCTGGATTAAGGGGGAAGAAATCAGGAAGCAGCCAGAACTCTGGGGGTAATGGATTACATCCCGACCACTCATACGCTCCTAGTACCTATACATATATGATTAGAAGATGCATTATTTCAGATTGAGAAATACTTTAGCCACAAATGGTTTACACAAAAGTAATCTCATAAACTAACATAGTTTCTTGTGATTTGTCAGATTGTAAAGTTATTTTTATTATAAAATAGATCTAACGGATTATATGAAAGCAAATAATAATAATAAAAATTCTATATGGTGAAAAACAAATTCTATAGAATCCTTTAAAATCTGATCGACCCATAAAATTATATATAGGAACTCTTAACAATTGTTCATACCGAGACCCAAAACTTTCCCCATGATGGCATTGCCACCAAACCACCATGGTCGAGGATCCATTTTCGGCCTCTATCCATGGCCCTATCTTCACCATCTTCGAGCCCCTCTCCAAGTATCCTCAAGGCAATATAGCTCAAAGCTGAGCCAAACATTGTGCTGTCTCCCACTATGTGAAAACTCCATCCTCCATCTTCATTCTGTCCAACAAACACAGTAAAATACACACACACACACACATGAATATATATATATATATTTAATATCTACCCAAAATATATCTCAGCTTCCTTGAACTTTTATTGAATCCAAACCTGAGTATTATATAGGTATCGAATGATTTCCTTCCGATGATGTGATGAGAACATGCGATCGAGATCCCCAGTAATAGACAATGCCATCACCTGCAAATGGTAGTAACTTCTATTTAAGTTTTCTACAGCAAATAACTTACCAGGATGATTTATTTATAAGCCATTTAAATAAAACAAAACATAATCATTATAAAATAATACTCTTATAACTCTTTTATTTGAATAAAAAATGCCATACCAAGGGCCCAACAAAAACCAAGGGTCCACCAAATTCAGCAGGCCAGTGGCCATCATGGGCCTGAAGGGAGGAAGTGGAGCTTAGTGCTCTTCTCAGTGTAGTTGTCACTGCTTCCTCTGTTATTTCCTCTGTTTCTTGGACTTTCACTGGTGGTGGAATTTGCCCACGTTGATTCTCCTTTCTGATCTGCAAGTTCAAAAGCCAAAAATATAATTGGAATTTCACAACTGATCAAAATTATTCTAATCATGCAATATCATTAAGGGTTAGAAAACCATCTATCATGACTATATATATGAATAACGGGATAATTTATAGGATAATTATTATTTTTTAAAATCACTAAAATTTTCAATATTCTACCTCAAATTTAAAATTTACCTATTCATTGAGATTTGGTCATTAACATAAATAAAAAAATAAGTACTACAGATACAAAGAAATTATACAAAATTATCAAATCCACAAACTGATGTGGTTTTATTTGATCCGTTAGATGTATTTTATAATAAAAATAACTTTACAATCTGACGTATAACATCAAGCCATATCAGTTTGTAAGTTTATTTTTATGTAATTTTTTTATGGCTAAACTATTTCTCTTGTAATTATTTTTTATTATTATTCATGGACAAAAAATGAAGTGACGTTATAGGTAATAAAATATTTTCAATTTTTTATTCACTTATGTGATTCCAGCTTCGAGGGGAATAAAGATGATTATCCTAGGAAATATAAATTGCCAGAAAAGAAGGGAGGGGTGGGAGGAGATGAGATGATCTTTGCATGTGAGGTCGTTAGTGCATGGGAAAATGTGGGTGTAGGAAAACATGGGTGTGCATTTGATCCAACCTTGAGTATGAGTGACTTGGAAGTCTGTTCTTGCATGCTAATTCTATACTCTGTTTCAAGGCAGAAATACTCATCCAGATCACCCTTTCCTCTCTATCGTAGTTTGCGGACTTGTGTATGCATGGGGACAAGGGACATGATGAGATATGATATTTGCAATCGTGGAATACTGCATAAGTGTAGTACAAACTTTTTGAAAAAGTAAGTAAATATGAGATTTACATAAAAAAATTAATTTTTTAATAATAGAAACGTTGAGTACCACACTCTTTTTTAAAATGATTACATGGTGCTTATGCACTACACGACTTCACATAGCATTACTCTTTTTTTAAAAATATTTATTAAAAAAAAACTTGAGCAAATCTTGCAACCATATTGAGGTACGAGTTTTTCACTCCGAAACTTTTTATCGTACTATATATCATGATGGTCTCATGATATATATGTATGTCTCGTCCACCATCATGATATATATGTATGTCTCGTCCACCATCATGATATATTGTATGTCTCGTCCCTAAGTCAAACGGAGTCGATCACATTTTGGCTATAATACCAATATTGCATCGAAGGCTATAAGGTTTCTCTCGAGGGTACGTATATACTTTGGCTTCTCTTTTTCCTTTAATATATGTGCATGGGTTGGTCGAATGCAATCACCTGTCCAATTTGATATATATACATGCATAGCGAAATACATTTGGTGTAGTAAATGCATGTGCACTTCAAATATATAGTATCTCTAGTACTCTTATATCATGCATGGTATTCAAGAAATTACTACTATAAGAGATTATCAGATCAGAAAAAGGAATAAAGGGAAAAAAGATCAAGAGAAGAATTAACCTGCATTCTCATCAAAAGATCACAACTTTGTTTCATCTTAAACCGATTTTTCTTGTATTCCTCACGGACCCTTTCAACTTCAGCATGTTCTTCAGGTGTACCGGCATTAGGGTCGAATTCCCAGTGTTCTCGCCCGATGAAATTGTTTACGCTCACCAAATCAGGGCCTCCTTGGGATACTTTCAACTTCCACATCCTGATGATCTATATATATATAAGTACTACAGATCATAACGTTCCTATATTCAGACAAGATCATCACTGATATATATAACCCTAAACAACTAGGAATTAGCATTTCATAAAAAGATGAGGGAATATATAGCTAACTAGGAATCTTCTCAAGAAAGGTTTAAGATAGTTATATGTCAAAAGTATTAATCATAATTAAATTACGTTACTTAAATTATTATAAAAAATTAAAAATTATCAATTGCAAGCCATGTGTAATTAAGCTCATGATAATCACTAGCTAGCTAGCTACGTACCCTCGTATGGAAGGAAGCTTGAACCTGAAATTAACTAGCATTAGCTAGGCACCTTGTTTTAGCGAAATTTTGAAAAACAGAAAAAGATTACACACGATGAATTAATTAATTATATATATATATATAAATATATATTATATTTGAATTTAGAAATTTATACGGCAAAACTAATTTTAAAAAATAGATTTAAAAAAAAGATTTAGGGTAAGATTGTAATTTTAATAGGAACTTGATCAATATATATGATTTCTTCTACAAAATATGACTATTATAGACTTAATTAGGGTTGTTAGGGGGGTTATAATATTCATTTTTTTTTCGGCAAAAGGGAACAAAAGCCTAAAAAAATAAAAAATTGAGGAGCCTTGGCCCCCCAACCGTCCCTGATCTTTACATGAGTGCAAGGAGCATGGCTTGTTGGCTGGAATGAACGAACTATACATGCATGCATATGAGATGGATTGGATGGAAATGATCATCATGATTTGCATATTTTGATCCCTCATCAACATCGATCATACTCTTTAGTCGAAAGTATCATCAGCAAATATTATTATATACATATATATATATATATAATGCAAATTATATAATTAAGAAAGATCAGAACAATCAAATTAATTTGTTGCCATGAAAAGTTGAGATCGAGCGAGAAATGAATGGAAAGAAGCAGAGAGATATTTCTTGAATACCTTTAAATTAGCTTAGAGATCAGCAAATTTATAAGAGGCCGGTCGAGGAAGAACACATTCGCTAGCTACTTCTTGAGATGCATGCAATAATATTAGCACAAGTAGTACTGCTAGCACTTGGCCTTTGGTAGAGGCTTATCCCCTTT
And we annotated these two homologous genes (A and B) using the merged denovo libraries and found that both could be annotated to Mutator (from one of the sets of denovo libraries), but using the ancestral whole genome did not annotate to Mutator at this gene A.
…------------------ 原始邮件 ------------------
发件人: "rmhubley/RepeatMasker" ***@***.***>;
发送时间: 2023年7月22日(星期六) 凌晨2:14
***@***.***>;
***@***.******@***.***>;
主题: Re: [rmhubley/RepeatMasker] The relationship between the location of Mutator transposons and exons (Issue #225)
Could you provide an example with the alignment data?
—
Reply to this email directly, view it on GitHub, or unsubscribe.
You are receiving this because you authored the thread.Message ID: ***@***.***>
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
We've observed that Mutator transposons annotated by RepeatMasker often overlap with part of the exons in a gene, meaning the exons frequently become part of the internal sequence of Mutator. However, when we extract Mutator sequences individually, we don't find any significant, specific structural features.
Thus, we are curious about the principles behind the annotation of Mutator transposons.
Could there have been an error in the annotation of Mutators?
The text was updated successfully, but these errors were encountered: