Skip to content

EliasReis13/Apollo-Solutions-Machine-Learning-Developer-Test

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

15 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Apollo Solutions Machine Learning Developer Test

A machine learning pipeline for analyzing genetic syndrome embeddings from image data. Developed for Apollo Solutions' ML Developer Practical Test.

πŸ“‹ Table of Contents

Installation πŸ“₯

  1. Clone the repository:
git clone https://github.com/yourusername/apollo-genetic-analysis.git
  1. Install dependencies
pip install -r requirements.txt

Project Structure πŸ—οΈ

genetic-syndrome-analysis/
β”œβ”€β”€ data/
β”‚   └── mini_gm_public_v0.1.p         # Raw dataset (embeddings)
β”œβ”€β”€ results/
β”‚   β”œβ”€β”€ plots/                        # Generated visualizations
β”‚   β”‚   β”œβ”€β”€ auc_comparison.png
β”‚   β”‚   β”œβ”€β”€ class_distribution.png
β”‚   β”‚   └── tsne_visualization.png
β”‚   β”œβ”€β”€ flattened_data.pkl            # Processed dataset
β”‚   └── knn_results.json              # Classification metrics
β”œβ”€β”€ scripts/
β”‚   β”œβ”€β”€ data_processing.py            # Data loading & preprocessing
β”‚   β”œβ”€β”€ eda.py                        # Exploratory data analysis
β”‚   β”œβ”€β”€ tsne_visualization.py         # Dimensionality reduction
β”‚   β”œβ”€β”€ knn_classification.py         # KNN implementation
β”‚   └── generate_plots.py             # Metric visualizations
β”œβ”€β”€ main.py                           # Main pipeline controller
β”œβ”€β”€ requirements.txt                  # Dependency list
└── README.md                         # This document

Usage 🚦

Full Pipeline Execution

python main.py

Individual Components

# Data preprocessing
python scripts/data_processing.py

# Generate EDA visualizations
python scripts/eda.py

# Create t-SNE plot
python scripts/tsne_visualization.py

# Run KNN classification
python scripts/knn_classification.py

# Generate performance plots
python scripts/generate_plots.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages