Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pandas FASTA/FASTQ reader #1

Open
luizirber opened this issue Apr 15, 2014 · 2 comments
Open

Pandas FASTA/FASTQ reader #1

luizirber opened this issue Apr 15, 2014 · 2 comments

Comments

@luizirber
Copy link

This is one problem I stumbled some days ago. A specialized reader for FASTA/FASTQ files might be useful, instead of parsing through other ways and then loading into a DataFrame.

Possible problem is the lack of specifications for these file formats, but a good start is just reading sequence name and content from FASTA (and quality from FASTQ)

@marcelcaraciolo
Copy link
Contributor

Thanks @luizirber for your feedback! I already used pandas to perform some data analysis with our genomical data specially bed files and the results were quite promising! I didn't find any pytthon package focused on genomical data analysis. Since I am a pandas huge fan and for us it's usual for us analysing our data, we decided to start a spin-off of pandas, called biopandas focused on biological data. I have some scripts here and I will start to commit them at this weekend. Your requirement is also mapped by us. Please bring more issues so we can discuss here! The project will be open-source and any suggestions or collaboration are welcome!

@ChillarAnand
Copy link

Any updates?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants