-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Training Data (includes tutorial, example) #28
Comments
follow |
2 similar comments
follow |
follow |
RADseq technologyGenetic mapPopulation genomicsThere is a reference genome on the shared data so the analysis can be made through the denovo_map as the ref_map pipelines Assemble read pairs |
How do I add the sample datasets that I have with me? |
@devikaatgit the best way is to upload the data on a https://usegalaxy.org/ Galaxy history. Then, share your history publicly. If you don't have an account, don't hesitate to create one, it's free ;) |
@devikaatgit If you need help, let us know |
Tutorials for RNA-seq, Assembly and Variant calling using small publicly available dataasetsGalaxy_Walkthrough.pdf |
Thank you very much @Eduardo-Alves ! In the meantime, not sure I can use your material because of:
|
@frederikcoppens Did you think there is a way to create Shared libraries for our group on the Galaxy main server ? |
@yvanlebras That's one of the possibilities and my personal favorite, needs to be discussed |
Bacterial RNA-seq data available at the following url http://54.158.166.52/u/aida/h/datahackathonab |
I posted kind of "advanced training sets" on the #30 post. They include larger data sets for RNAseq and RADseq that are all from publically accessible data, that highlight typical issues in data analysis, and use published analyses. Might be good as a kind of second pass training set for each of these analyses, as they may not be as straight forward as a "toy set" since they are real data with real issues. I did not upload the data yet, because I have to transfer it from the cluster to my computer and back up to galaxy (unless anyone knows a faster means of doing this?). |
2 condition datasets with (single replicates only) for RNA-seq of bacteria can be accesses at https://usegalaxy.org/u/devikasub/h/bacterial-rna-seq-2-condition-single-replicate-datasets |
thanks everyone! What is next? We need a list ticket for to-do items. Can reference this ticket and others. Would someone like to draft one or should I? Shared Lib on Main > assign to me. Moving the data above into that, organized and labeled, is also me (in collaboration with authors above and in master ticket). Use the hack mailing list to synch up for this and related? |
Contributors: @jennaj @griffinp @kpoterlo @yvanlebras @BoughAida @ssander5 @devikaatgit @cschu @tnabtaf @kmurat1
This issue is dedicated to Training Data hackathon group. The idea is to gather sample data who can be used as example, tutorial, .... on Galaxy instances.
Please, don't hesitate to create a comment and add data links and description ;)
Example:
RADseq technology
Genetic map
-parents
-progeny
Population genomics
If data are not reachable through the web (personal data on your laptop, ...) , the best way is to upload the data on a https://usegalaxy.org/ Galaxy history
The idea can be to meet after having gathering data and discuss about which one are good / duplicate / too big before proposing actions like, data directly shareable, need to be reduced, ....
The text was updated successfully, but these errors were encountered: