Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

steps for training #18

Open
brianwalenz opened this issue May 8, 2019 · 2 comments
Open

steps for training #18

brianwalenz opened this issue May 8, 2019 · 2 comments

Comments

@brianwalenz
Copy link

I like this approach to simulation; it lets me easily control the sequences simulated (e.g., letting me make chimeric reads, reads with garbage in the middle, etc) and, in theory, lets me try different base callers on those signals. My use case is to generate reads with various levels of junk in them to test assembly algorithms.

I'm confused about how training is accomplished. Two questions:

If I have a big pile of fast5 files, what steps do I need to convert that into inputs for training? I see I need to supply a 'rawsig' file and a 'fasta' file. Where do those come from?

In particular, if the fasta is the result of base calling the fast5, aren't you then training to make signal that will result in the correct sequence for that particular base caller?

@realbigws
Copy link
Collaborator

realbigws commented May 29, 2019 via email

@Merritt-Brian
Copy link

I am having the same problem as you, brianwalenz where I need to retrain from fast5 and fastq files to rawsig and a fasta. Has there been any progress on this particular issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants