Skip to content
View SarathL754's full-sized avatar

Block or report SarathL754

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. Reducing-Hallucinations-with-Direct-Preference-Optimization Reducing-Hallucinations-with-Direct-Preference-Optimization Public

    An RLHF-inspired DPO framework that explicitly teaches LLMs when to refuse, significantly reducing hallucinations.

  2. Decision-Transformer-from-Scratch-HalfCheetah-Minari-BC-vs-Return-Conditioned-DT Decision-Transformer-from-Scratch-HalfCheetah-Minari-BC-vs-Return-Conditioned-DT Public

    Implementing Decision Transformers from scratch for offline RL, benchmarking return-conditioned policies against Behavior Cloning.

    Python

  3. VulneraAI-agent VulneraAI-agent Public

    An agentic LLM security scanner that analyzes applications against OWASP Top 10 using tool-calling, LangGraph, and AWS Bedrock.

    Python

  4. Email-Assistant-langgraph Email-Assistant-langgraph Public

    Python

  5. Multi-agent-RL-texas-holdem-aec Multi-agent-RL-texas-holdem-aec Public

    An engineering-focused multi-agent reinforcement learning system for Texas Hold’em using PettingZoo AEC and a custom PyTorch PPO self-play setup.

    Python

  6. Alzheimer-Disease-Stage-Classification-CNNs-vs-Transformers- Alzheimer-Disease-Stage-Classification-CNNs-vs-Transformers- Public

    A comparative study of CNNs vs Vision Transformers for Alzheimer’s disease stage classification on brain MRI, with detailed error and performance analysis

    Jupyter Notebook