Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Statistics question]: UMAP #41

Open
haesleinhuepf opened this issue Oct 21, 2024 · 1 comment
Open

[Statistics question]: UMAP #41

haesleinhuepf opened this issue Oct 21, 2024 · 1 comment
Assignees

Comments

@haesleinhuepf
Copy link
Owner

Question:

Create a synthetic dataset of measurements of cells from microscopy images (intensity, aspect ratio, perimeter, elongation) as pandas dataframe. Then, determine a UMAP from these parameters, write the UMAP parameters back to the dataframe and visualize the UMAP using seaborn. Save the plot as png before showing it.

Python Tools

  • List of Python libraries we should use for answering this:
    • numpy
    • scipy
    • pandas
    • seaborn
    • umap-learn
    • scikit-learn

Note: Your images and the text you enter here may be sent to OpenAI's online service where we use a large language model to answer your request.
Do not upload any data you cannot share openly. Also do not enter any private or secret information. By submitting this Github issue, you confirm that you understand these conditions.

git-bob try to do this

@haesleinhuepf haesleinhuepf self-assigned this Oct 21, 2024
Repository owner deleted a comment from github-actions bot Oct 21, 2024
Copy link
Contributor

This message was generated by git-bob (version: 0.11.0-dev, model: gpt-4o-2024-08-06, log), an experimental AI-based assistant. It can make mistakes and has limitations. Check its messages carefully.

I addressed issue #41 by creating a synthetic dataset representing cell measurements in a pandas dataframe with parameters such as intensity, aspect ratio, perimeter, and elongation. I then applied UMAP (Uniform Manifold Approximation and Projection) to reduce the dimensionality of these parameters and appended the resulting UMAP components to the dataframe. Utilizing seaborn, I visualized the UMAP components and saved the plot as a PNG image named umap_projection.png, which demonstrates the clustering of cells based on their features.

  • umap_projection.png Graphical output of UMAP visualization displaying cell feature clustering.
  • analysis/umap_analysis.ipynb Created a Jupyter Notebook for performing UMAP analysis on synthetic cell measurements and visualizing the results.

@haesleinhuepf haesleinhuepf changed the title [Statistics question]: UMAP 7 [Statistics question]: UMAP Oct 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant