Xiaoqiang Shi
PhD student at University of Chinese Academy of Sciences. Multimodal LLMs / Agents / 3D Vision.
University of Chinese Academy of Sciences
Email: shixiaoqiang21@mails.ucas.ac.cn
I am a PhD student in Computer Application Technology at the University of Chinese Academy of Sciences. My current interests include multimodal large language models, agent systems, computer vision, and 3D Gaussian Splatting based avatar reconstruction.
My recent work spans two connected directions. On the foundation-model side, I have hands-on experience with decoder-only Transformer training pipelines, tokenizer and data construction, pretraining, SFT/LoRA fine-tuning, reinforcement learning training, sampling, KV cache, multimodal alignment, VQA/caption prototypes, and tool-using coding agents. On the vision side, I work on semantic segmentation, object detection, and high-fidelity animatable human/head avatars based on 3D Gaussian representations.
Before my PhD study, I received my B.Eng. degree in Mechanical Design, Manufacturing and Automation from University of Science and Technology Liaoning. I use this site to maintain my project portfolio, CV, publications, and research/engineering notes.
news
| Jul 03, 2026 | Launched the first cleaned-up version of this personal homepage. |
|---|
latest posts
selected publications
- PR3DGA: 3D Avatar Animation from Monocular Video via Deformable Gaussian SplattingPattern Recognition, Jan 2026
- TCSVTBSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired from AutoEncoderIEEE Transactions on Circuits and Systems for Video Technology, Oct 2023
- MMCANVAS: Any-Shot Animatable 3D Gaussian Head Avatars from 1-K ImagesSubmitted to ACM Multimedia, 2026