xyangk

Follow

XiaoYang xyangk

Follow

bio

21 followers · 21 following

Achievements

Achievements

Stars

AUK9527 / Are-u-ok

12,616 2,802 Updated Nov 20, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 38,723 4,869 Updated Dec 9, 2025

X-PLUG / MobileAgent

Mobile-Agent: The Powerful GUI Agent Family

Python 6,680 681 Updated Dec 2, 2025

OSU-NLP-Group / GUI-Agents-Paper-List

Building a comprehensive and handy list of papers for GUI agents

Python 583 31 Updated Oct 27, 2025

FareedKhan-dev / Building-llama3-from-scratch

LLaMA 3 is one of the most promising open-source model after Mistral, we will recreate it's architecture in a simpler manner.

Jupyter Notebook 194 44 Updated Aug 23, 2024

harpreetsahota204 / gui_dataset_creator

GUI Dataset Collector: A Tool for Capturing and Annotating GUI Interactions with annotations in COCO format

HTML 10 3 Updated Aug 18, 2025

huggingface / deep-rl-class

This repo contains the Hugging Face Deep Reinforcement Learning Course.

MDX 4,657 757 Updated Oct 1, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,768 1,812 Updated Oct 13, 2025

pybind / pybind11

Seamless operability between C++11 and Python

C++ 17,549 2,251 Updated Dec 15, 2025

xlite-dev / Awesome-LLM-Inference

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4,821 327 Updated Nov 28, 2025

nickaggarwal / nvidia-triton-llm-streaming

Integrating SSE with NVIDIA Triton Inference Server using a Python backend and Zephyr model. There is very less documentation how to use Nvidia Triton in Streaming use-cases ( hard to find in their…

Python 10 Updated May 29, 2024

immich-app / immich

High performance self-hosted photo and video management solution.

TypeScript 86,550 4,555 Updated Dec 16, 2025

linkedin / Liger-Kernel

Efficient Triton Kernels for LLM Training

Python 5,951 448 Updated Dec 15, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 18,081 2,671 Updated Nov 3, 2025

01-ai / Yi-1.5

Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.

557 34 Updated Nov 11, 2024

casper-hansen / AutoAWQ

AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:

Python 2,294 294 Updated May 11, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,398 1,686 Updated Sep 24, 2025

naklecha / llama3-from-scratch

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,201 1,290 Updated May 23, 2024

NousResearch / Hermes-Function-Calling

Jupyter Notebook 1,149 142 Updated Sep 13, 2024

QuixiAI / OpenChatML

164 9 Updated Aug 8, 2025

MinorJerry / WebVoyager

Code for "WebVoyager: WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models"

Python 977 109 Updated Mar 4, 2024

ddupont808 / GPT-4V-Act

AI agent using GPT-4V(ision) capable of using a mouse/keyboard to interact with web UI

JavaScript 1,060 100 Updated Dec 9, 2024

web-arena-x / visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Python 413 68 Updated Nov 9, 2024

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

Python 9,472 747 Updated Jun 7, 2025

OFA-Sys / Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Jupyter Notebook 5,704 539 Updated Aug 29, 2025

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 28,405 3,329 Updated Jun 26, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,211 984 Updated Jul 1, 2024

OpenBMB / MiniCPM

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,467 526 Updated Oct 8, 2025

shuxueslpi / chatGLM-6B-QLoRA

使用peft库，对chatGLM-6B/chatGLM2-6B实现4bit的QLoRA高效微调，并做lora model和base model的merge及4bit的量化（quantize）。

Python 356 45 Updated Aug 22, 2023

ossu / computer-science

🎓 Path to a free self-taught education in Computer Science!

HTML 199,137 24,774 Updated Aug 23, 2025