Skip to content
View MichaelXcc's full-sized avatar
:octocat:
have a nice day
:octocat:
have a nice day
  • freelance work
  • beijing

Block or report MichaelXcc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MichaelXcc/README.md

πŸš€ Building high-performance LLM inference platforms on Kubernetes

Typing SVG


πŸ›  Tech Stack

πŸ’» Programming Languages

☁️ Cloud Native & DevOps

πŸ€– LLM Engineering & Inference

🧩 AI Agents & RAG Workflows


πŸ”­ What I'm Working On

Focusing on the intersection of Cloud Native and AI Infrastructure.

πŸš€ Core Projects

  • Building a K8s-native Inference Platform with vLLM & SGLang
  • Writing Custom Controllers in Go for GPU pooling
  • Implementing Volcano queue visualization & priority scheduling

⚑ Performance

  • Tuning H800/A800 GPU metrics for large scale training
  • Optimization of BF16 matrix multiplication (GEMM)
  • Designing observability pipelines for Model Serving

snake

Pinned Loading

  1. infiniflow/ragflow infiniflow/ragflow Public

    RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

    Python 72.5k 8k

  2. dify dify Public

    Forked from langgenius/dify

    Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

    TypeScript

  3. easy-dataset easy-dataset Public

    Forked from ConardLi/easy-dataset

    A powerful tool for creating fine-tuning datasets for LLM

    JavaScript

  4. LLaMA-Factory LLaMA-Factory Public

    Forked from hiyouga/LlamaFactory

    Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

    Python

  5. kubernetes kubernetes Public

    Forked from kubernetes/kubernetes

    Production-Grade Container Scheduling and Management

    Go

  6. volcano-sh/volcano volcano-sh/volcano Public

    A Cloud Native Batch System (Project under CNCF)

    Go 5.3k 1.3k