Skip to content
View viochris's full-sized avatar
๐ŸŽฏ
Learning
๐ŸŽฏ
Learning

Block or report viochris

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
viochris/README.md

Hi there, I'm Vio! ๐Ÿ‘‹

A Data Science Enthusiast & Computer Science Student

Coding Animation ย  Real Coding

I am a 5th-semester Informatics Engineering student at Universitas Dian Nuswantoro (UDINUS), Indonesia. My journey is driven by curiosityโ€”turning raw data into meaningful stories. I specialize in mining insights from structured numbers (Tabular) and unstructured language (Text/NLP).


๐Ÿ“š Table of Contents


๐Ÿง What I'm Focusing On

  • Data Science: Exploratory Data Analysis (EDA), Feature Engineering, and Model Building.
  • NLP (Natural Language Processing): Text Classification, Sentiment Analysis, and Language Modeling.
  • Tabular Data: Handling imbalanced datasets (SMOTE/SMOTENC), preventing data leakage, and interpretability.

๐Ÿ› ๏ธ Tech Stack & Tools

Languages: Python SQL

Data Science & Machine Learning: Pandas NumPy Scikit-Learn TensorFlow

NLP & GenAI Frameworks: Hugging Face LangChain

Web Framework (API): Flask FastAPI

Visualization: Matplotlib Seaborn

Environment & Tools: Google Colab Kaggle Jupyter Git


๐Ÿš€ Featured Projects

Here is a comprehensive list of my Data Science & AI projects, ranging from robust predictive modeling to API deployment and NLP exploration:

๐Ÿค– GenAI & Interactive Applications

Interactive AI agents and chatbots powered by RAG and LLMs.

Project Name Domain Key Tech Description
InsightSQL (ReAct Engine) GenAI / SQL Agent LangChain (ReAct), Gemini 2.5 Flash Advanced Reasoning Agent. Intelligent DBA using ReAct Architecture (Thought-Action-Observation) to reason through messy data. Features Glass Box AI transparency, self-correction, and multilingual business insights.
InsightSQL (Simple) GenAI / SQL Agent LangChain (SQL Agent), Gemini 2.5 Flash Zero-Shot SQL Engine. Streamlined Text-to-SQL assistant using standard create_sql_agent. Efficiently maps schema to queries with real-time execution feedback via Streamlit Callbacks.
Chat with Vio (Portfolio AI) GenAI / Portfolio Streamlit, LangChain, Gemini 2.5 Interactive CV. A personalized RAG assistant that "chats" with a professional profile. Features Strict Anti-Hallucination, bilingual support (EN/ID), and a responsive UI optimized for recruiters.
DocuTalk-AI (Smart RAG) GenAI / RAG LangChain, Gemini 2.5 Flash, FAISS Agentic Workflow. Intelligent document assistant using ReAct Agents to decide between Document retrieval (PDF, CSV, TXT, MD) (FAISS) or Web Search. Features Incremental Indexing (cost-efficient), strict language control, and session export.
DocuFlow-AI (Fast RAG) GenAI / RAG LangChain, Gemini 2.5 Flash, FAISS High-Performance Engine. Streamlined RAG using ConversationalRetrievalChain for instant, precise analysis (PDF, CSV, MD). Features Transparent Verification (Source & Score), incremental indexing, and zero agentic latency.
DocuChat AI (Long Context) GenAI / LLM Gemini 2.5 Flash, Streamlit, Python Full Context Approach. Analyzing entire PDFs without chunking (RAG alternative) using Gemini's massive context window. Features Strict Language Enforcement and real-time token monitoring.
AI Recipe Generator GenAI / Agent LangChain, Gemini, Streamlit Agentic App. Orchestrated a cooking assistant using LangChain ReAct Agents and Gemini 2.5 Flash to detect ingredients (Vision) and generate recipes with conversation memory.

๐Ÿ”Œ Backend & Deployment (MLOps)

High-performance APIs and production-ready inference engines.

Project Name Domain Key Tech Description
Portfolio API (Backend) Backend / API FastAPI, Docker, FAISS Headless RAG Engine. The high-performance backend serving the portfolio data. Features Swagger UI for testing, structured JSON responses with Source Citations, and fully Dockerized deployment on Hugging Face Spaces.
Stuntify API (MLOps) API / MLOps FastAPI, Pydantic, Scikit-Learn High-Performance Engine. A robust inference system using FastAPI to serve real-time stunting predictions. Features strict Type Coercion, multi-artifact orchestration, and interactive Swagger documentation.
Insightify API (NLP) NLP / API FastAPI, Transformers, Pandas Dual-Lingual Analytics. NLP microservice for English & Indonesian sentiment using RoBERTa. Features Batch Processing (Excel/CSV), N-Gram keyword extraction, and automated text statistics.
Stunting Prediction API API / MLOps Flask, Python Deployment. Serving the stunting prediction model as a RESTful API for real-time inference using Flask.

๐Ÿ“Š Data Science (Applied ML & Model Building)

In-depth data analysis, model training, NLP implementation, and academic research.

Project Name Domain Key Tech Description
Diabetes Prediction (Robust) Tabular Scikit-Learn, SMOTE Best Practice. Built a robust model ensuring no data leakage by applying Scaling & SMOTE after data splitting.
Stunting Analysis Tabular SMOTENC, Tree Viz Handling categorical imbalance with SMOTENC and visualizing decision paths (Pruned Tree) for interpretability.
Review Sentiment Analyzer NLP Transformers, Python Leveraging Hugging Face pre-trained models to accurately classify text sentiment (Positive/Negative).
Diabetes Paper Replication Academic Python, Pandas Research Study. Replicating a specific research paper's methodology (including its pre-split scaling technique) for comparative analysis.

๐Ÿ“ˆ Data Analytics & Visualization

Focusing on exploratory data analysis (EDA), business intelligence dashboards, and visual storytelling.

Project Name Domain Key Tech Description
Supermarket Sales Dashboard Retail Analytics Tableau Public Sales & Operations. Dashboard monitoring Total Sales, COGS, and Customer Ratings across major cities. Features breakdown by Payment Methods (E-wallet/Cash) and Product Line performance.
Bike Sales Dashboard Executive Sales Tableau Public Revenue Tracker. High-level executive view of Revenue and Unit Sales. Analyzes performance by US States, Brand Names, and identifies Top 10 High-Value Customers.

๐ŸŽฎ Experimental & Fun Projects

Creative coding projects exploring logic flows, real-time communication, and game-like interactions.

Project Name Domain Key Tech Description
Conversational AI Agent GenAI / Persona Streamlit, Gemini LLM Roleplay Chatbot. An engaging AI companion simulating a romantic partner using Google Gemini, featuring persistent memory and dynamic persona adaptation.
Real-time Chat Group Social / Socket Python, Networking Multi-User Chat. A casual project exploring real-time communication and socket programming principles to create a functional group chat environment.

โš ๏ธ Disclaimer: These projects are created purely for educational purposes and fun. They are provided "as-is" without warranty of any kind. The author is not responsible for any security vulnerabilities, data loss, or misuse resulting from the deployment of this code. Not intended for production use.


๐Ÿ“ˆ GitHub Stats

GitHub Streak Productive Time

๐Ÿ“ซ Connect with Me & Resume

Download CV ย  View CV
Email LinkedIn

"Data is the new oil, but itโ€™s useless without the engine to refine it."

Pinned Loading

  1. Conversational-AI-Agent Conversational-AI-Agent Public

    Python 1

  2. Simple-Sentiment-Analysis Simple-Sentiment-Analysis Public

    Python 1

  3. Diabetes-prediction-project Diabetes-prediction-project Public

    ๐Ÿ” Predict diabetes using ML + Streamlit โ€” Full pipeline with dashboard & joblib model!

    Jupyter Notebook 1

  4. Diabetes-prediction-fine-tuned-project Diabetes-prediction-fine-tuned-project Public

    ๐Ÿ” Predict diabetes using ML + Fine Tuning + Streamlit โ€” Full pipeline with dashboard & joblib model!

    Jupyter Notebook 1

  5. Stunting-prediction-project Stunting-prediction-project Public

    Jupyter Notebook 1

  6. API-Stuntify API-Stuntify Public

    Python 1