-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
seaborn-v0_8-dark issue persisting in v0.1.8, attempts to re-run without re-modelling yields different passed variants #28
Comments
Hi @rachwo , Thanks for reporting this issue. For the first matplotlib issue, can you please run the following in your python terminal and check if the your matplotlib version is up to date?
The motivation for v0.1.8 fix was to address this matplotlib update where the names changed and the old style name was no longer available. You can check with:
For the second issue, that is definitely not expected behaviour and would need some debugging. Can you please check if the new deltaBIC_cdf.pdf looks anything like your previous run? There can be a few possible things going wrong so I want to first check if the knee point is behaving normally. Thanks for your patience and sorry for the bug! |
Thanks for the response! My matplotlib version was older. I've since updated my python (v3.11) and matplotlib version (3.9.1) and am able to generate all expected output files (BIC_params.csv, debug_unsorted_BIC_params.csv, passed_ad.mtx, passed_dp.mtx, deltaBIC_cdf.pdf, top variants heatmap.pdf, and passed_variant_names.txt) using this code:
However, this time, there are more variants in the "passed_variant_names.txt" than what had been generated in my very first attempt (as in my initial post). As a second test, I attempted to rerun without remodelling using the following code:
I noticed again that the variant names are different than what was generated directly above. I've attached the output top variant heatmaps and deltaBIC cdf plots from both (first run, plus rerun). Hopefully this is helpful to you, but let me know if you need any other info or clarification. Unrelatedly, are the mitochondrial variant names in the variant_name column of debug_unsorted_BIC_params.csv named such that Thanks again for your help. Generated from first run (first codeblock in this post): Generated from the re-run (second codeblock in this post): |
Thank you so much for providing me with the files, they were very helpful in debugging. I have pushed a hotfix to the dev version of MQuad, can you please try to install from the repo directly and see if it solves the problem? The output of your second run should be the same as your first now (I am not entirely sure but that's what the dev branch is for). Also, you are right with the nomenclature of the variants. The names in MQuad correspond to your cellSNP output, however in cellSNP, if you did not provide a ref genome, the allele with the largest count is considered REF, which might cause some issue in certain positions where the REF and ALT are reversed, so I would suggest checking if you are making custom plots. It does not affect the modelling in MQuad though. |
Hi there!
After running cellsnp-lite, I attempted to run mquad using the following:
but get this issue:
The output directory contained the following files:
I was missing the passed_dp.mtx and passed_ad.mtx, and top variants heatmap.pdf so I attempted to re-run without re-fitting the model using:
This seemed to work and it generated
However, I noticed that the passed_variant_names.txt file contains a different list of variants than the first list. Why might that be? The variants in the top variants heatmap pdf match the variants in the passed_variant_names.txt file generated from the re-run attempt, but when I look at their scores* they seem to be incorrect. Is this expected behaviour?
*(I assume based on looking at the BIC_params.csv output. The passed_variant_names.txt generated from the initial run had a higher num_cells and deltaBIC. The passed variant names generated from the re-run pointed to variants that had deltaBIC <=0 and lower num_cells.)
Thanks in advance!
The text was updated successfully, but these errors were encountered: