Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PCoA Plotly Plotting from combined counts-metadata DF #79

Open
skennedy8 opened this issue Jul 21, 2021 · 1 comment
Open

PCoA Plotly Plotting from combined counts-metadata DF #79

skennedy8 opened this issue Jul 21, 2021 · 1 comment
Assignees
Labels
module: analysis new issue Recently opened issue

Comments

@skennedy8
Copy link
Contributor

Description

A combined DF including both normalized counts and metadata is a convenient means of data analysis. The DF is often validated for samples having both types of data and facilitates the sorting of samples.

The issue involves modifying/extending existing PCoA plotting in moonstone to handle this format.

Additional information

To preserve existing code promote stability, an attempt will be made to use metadata columns to filter any combined DF,
where only counts are required in the pipeline.

@skennedy8 skennedy8 added the new issue Recently opened issue label Jul 21, 2021
@skennedy8 skennedy8 self-assigned this Jul 21, 2021
@skennedy8
Copy link
Contributor Author

The goal here has been changed to reflect experience while performing the analysis.
There is really not an easy means of combining COUNT and METADATA into a single DataFrame; there needs to be validation of sample with both counts and clinical data. Integrating clinical data also means dealing with missing values, data types and selecting variables of interest. This seems to be best accomplished on a case-by-case basis in individual notebooks using the available codebase for generating the distance matrices and performing visualizations.

A useful addition would be a DataFrame ' cleaner/validator' for the metadata.fr used in the visualize_pcoa function.
A second objective is to add a function to perform PERMANOVA using the distance matrix and metadata_df. Is this the right place for it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
module: analysis new issue Recently opened issue
Projects
None yet
Development

No branches or pull requests

1 participant