Xiaoqiang Shi

PhD student at University of Chinese Academy of Sciences. Multimodal LLMs / Agents / 3D Vision.

prof_pic.jpg

University of Chinese Academy of Sciences

Email: shixiaoqiang21@mails.ucas.ac.cn

GitHub · Google Scholar

I am a PhD student in Computer Application Technology at the University of Chinese Academy of Sciences. My current interests include multimodal large language models, agent systems, computer vision, and 3D Gaussian Splatting based avatar reconstruction.

My recent work spans two connected directions. On the foundation-model side, I have hands-on experience with decoder-only Transformer training pipelines, tokenizer and data construction, pretraining, SFT/LoRA fine-tuning, reinforcement learning training, sampling, KV cache, multimodal alignment, VQA/caption prototypes, and tool-using coding agents. On the vision side, I work on semantic segmentation, object detection, and high-fidelity animatable human/head avatars based on 3D Gaussian representations.

Before my PhD study, I received my B.Eng. degree in Mechanical Design, Manufacturing and Automation from University of Science and Technology Liaoning. I use this site to maintain my project portfolio, CV, publications, and research/engineering notes.

news

Jul 03, 2026 Launched the first cleaned-up version of this personal homepage.

latest posts

selected publications

  1. PR
    3DGA: 3D Avatar Animation from Monocular Video via Deformable Gaussian Splatting
    Xiaoqiang Shi
    Pattern Recognition, Jan 2026
  2. TCSVT
    BSSNet: A Real-Time Semantic Segmentation Network for Road Scenes Inspired from AutoEncoder
    Xiaoqiang Shi
    IEEE Transactions on Circuits and Systems for Video Technology, Oct 2023
  3. MM
    CANVAS: Any-Shot Animatable 3D Gaussian Head Avatars from 1-K Images
    Xiaoqiang Shi
    Submitted to ACM Multimedia, 2026