Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-evaluate impact of error correction #273

Open
standage opened this issue Jun 18, 2018 · 0 comments
Open

Re-evaluate impact of error correction #273

standage opened this issue Jun 18, 2018 · 0 comments

Comments

@standage
Copy link
Collaborator

Performing error correction drastically reduces the sequence content (specifically the number of distinct k-mers) in each data set, and accordingly the amount of memory required to track k-mer counts accurately. At one point we were pretty enthusiastic about this improvement, but abandoned it at one point since it led to some false negatives.

I think this decision was based on a small number of manually inspected variants (perhaps even 1), and not on overall statistics. And in any case all of the variants involved were SNVs, where our superiority is already marginal. We should re-investigate kevlar's performance on the latest simulations using error corrected data.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant