🎯
Focusing
M.S. student in Multimodal LLMs | VLM alignment, Multimodal RAG, Efficient fine-tuning | Building practical open-source tools
-
Graduate Researcher @ Tsinghua University
- Beijing, China
-
00:33
(UTC +09:00)
Popular repositories Loading
-
mmrag-strategy-bench
mmrag-strategy-bench PublicLightweight benchmark for multimodal RAG retrieval strategies
Python
-
vlm-token-budget-lab
vlm-token-budget-lab PublicVisual token budget simulator for efficient VLM inference
Python
-
-
-
WAM-Diff
WAM-Diff PublicForked from fudan-generative-vision/WAM-Diff
WAM-Diff: A Masked Diffusion VLA Framework with MoE and Online Reinforcement Learning for Autonomous Driving
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.