GitHub - oliviahelens/Python_MNIST_CC: A simple neural net Python script for training on the MNIST dataset given to Claude Code with instructions to significantly improve the model.

Python_MNIST

Overview

After building a simple neural network that recognizes handwritten digits (0-9) from scratch using GitHub Codespaces, Claude Opus 4.5, and the MNIST dataset, this repo was given to Claude Code with a mandate to improve the model.

Result: Model accuracy improved from ~80% to 98.5% on MNIST test set through basic modern deep learning techniques.

Model Architecture

Input: 784 pixels (28x28 grayscale image)
Hidden layers: 4 layers with 512 → 384 → 256 → 128 neurons
Output: 10 neurons (digits 0-9) with softmax
Activation: ReLU with 30% dropout
Initialization: He initialization for improved gradient flow

Training Features

Mini-batch SGD (batch size 128) with data shuffling
Learning rate scheduling with decay at 50%, 70%, and 90% of training
80 epochs with validation monitoring
Dropout regularization to prevent overfitting

Structure

network.py - Deep neural network class with forward/backward pass
train.py - Data loading, training loop with validation
predict.py - Image preprocessing and prediction with test-time augmentation

Dataset

MNIST - 70,000 labeled images of handwritten digits (28x28 grayscale)

Requirements

Python 3.x
NumPy
Pillow
SciPy

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
__pycache__		__pycache__
.gitignore		.gitignore
INSTRUCTIONS.md		INSTRUCTIONS.md
README.md		README.md
network.py		network.py
predict.py		predict.py
test_digit_0.png		test_digit_0.png
test_digit_1.png		test_digit_1.png
test_digit_2*.png		test_digit_2*.png
test_digit_2.png		test_digit_2.png
test_digit_3.png		test_digit_3.png
test_digit_4.png		test_digit_4.png
test_digit_5.png		test_digit_5.png
test_digit_6.png		test_digit_6.png
test_digit_7.png		test_digit_7.png
test_digit_8.png		test_digit_8.png
test_digit_9.png		test_digit_9.png
train.py		train.py
weights.npz		weights.npz

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Python_MNIST

Overview

Model Architecture

Training Features

Structure

Dataset

Requirements

About

Uh oh!

Releases

Packages

Languages

oliviahelens/Python_MNIST_CC

Folders and files

Latest commit

History

Repository files navigation

Python_MNIST

Overview

Model Architecture

Training Features

Structure

Dataset

Requirements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages