-
Notifications
You must be signed in to change notification settings - Fork 119
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Sample names with spaces are not supported #199
Comments
Would it be okay if I convert the " " to a "_"?
Can you email me this bam with 1/2 million alignments?
…On Fri, Jul 21, 2017 at 2:32 PM, Dave Larson ***@***.***> wrote:
I have unfortunately encountered a file with an SM tag containing a space.
I believe this is valid.
When running lumpyexpress on this file the resulting VCF truncates the
sample name at the space.
For example, the tag:
@rg ID:2895621816 <(289)%20562-1816> CN:WUGSC LB:lib1 PL:ILLUMINA PU:XXXXXX.X SM:SAMPLE -50
would result in a VCF containing the sample SAMPLE instead of SAMPLE -50.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#199>, or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAlDUQ6t-PrEYWVSHTgHwA-l2TmpcLXBks5sQQragaJpZM4Of3K6>
.
--
Ryan Layer
|
I think converting a space to an underscore would just put me in the same position I'm in now, a different sample name in the output file and poor interaction with tools that handle the sample name properly. Unfortunately, this is protected access data and thus, I can't share. |
Can you send me the command that lumpyexpress tries to run? I was thinking that it may work if the -pe and -sr options could be encased in the double quotes. |
Here's what is printed in the log file.
|
I don't have the temp directory preserved, but I can rerun if that would help. |
@ryanlayer - I think I may have a solution to this (#208) but I've not tested extensively and my bash is pretty weak. Let me know if you see any issues with what I put together. |
I have unfortunately encountered a CRAM file with an SM field within the @rg lines containing a space. I believe this is valid according to the BAM specification.
When running lumpyexpress on this file the resulting VCF truncates the sample name at the space.
For example, the line:
would result in a VCF containing the sample
SAMPLE
instead ofSAMPLE -50
.The text was updated successfully, but these errors were encountered: