Zhenzhen (Jen) Weng

I am a fifth year Ph.D student in Computational and Mathematical Engineering (ICME) at Stanford University where I am advised by Prof. Serena Yeung.

I am broadly interested in 3D computer vision and machine learning. Specifically, my current research efforts are focused on developing cutting-edge deep learning algorithms for perceiving and modeling human pose and behavior in the 3D world. I am alo interested in applying my work towards developing AI-assisted healthcare solutions.

Prior to my Ph.D, I received B.S. in Computer Science and B.S. in Mathematics from Carnegie Mellon University. I also previously worked for a fund manager on the East Coast.

Email  /  Linkedin  /  Twitter  /  Google Scholar


Feb, 2023: :sparkles: 2 Papers accepted to CVPR 2023. :sparkles:

Jan, 2023: :sparkles: Joining Adobe Research as a Research Scientist Intern this summer. :sparkles:

Research and Publications
Diffusion HPC: Generating Synthetic Images with Realistic Humans
Zhenzhen Weng, Laura Bravo, Serena Yeung
Preprint | Website | Code

Recent text-to-image generative models such as Stable Diffusion often struggle to preserve plausible human structure in the generations. We propose Diffusion model with Human Pose Correction (Diffusion-HPC), a method that generates photo-realistic images with plausible posed humans by injecting prior knowledge about human body structure. The generated image-mesh pairs are well-suited for downstream human mesh recovery task.

3D Human Keypoints Estimation from Point Clouds in the Wild without Human Labels
Zhenzhen Weng, Alexander S. Gorban, Jingwei Ji, Mahyar Najibi, Yin Zhou, Dragomir Anguelov
Conference on Computer Vision and Pattern Recognition (CVPR), 2023

We propose GC-KPL - Geometry Consistency inspired Key Point Leaning. By training on the large WOD training set without any annotated keypoints, we attain reasonable performance as compared to the fully supervised approach. Further, the backbone benefits from the unsupervised training and is useful in downstream fewshot learning of keypoints, where fine-tuning on only 10 percent of the labeled training data gives comparable performance to fine-tuning on the entire set.

NeMo: 3D Neural Motion Fields from Multiple Video Instances of the Same Action
Kuan-Chieh Wang, Zhenzhen Weng, Maria Xenochristou, Joao Pedro Araujo, Jeffrey Gu, C. Karen Liu, Serena Yeung
Conference on Computer Vision and Pattern Recognition (CVPR), 2023
Paper | Website

We aim to bridge the gap between monocular HMR and multi-view MoCap systems by leveraging information shared across multiple video instances of the same action. We introduce the Neural Motion (NeMo) field. It is optimized to represent the underlying 3D motions across a set of videos of the same action.

Domain Adaptive 3D Pose Augmentation for In-the-wild Human Mesh Recovery
Zhenzhen Weng, Kuan-Chieh (Jackson) Wang, Angjoo Kanazawa, Serena Yeung
International Conference on 3D Vision (3DV), 2022
Paper | Project Page | Code

We propose Domain Adaptive 3D Pose Augmentation (DAPA), a data augmentation method that combines the strength of methods based on synthetic datasets by getting direct supervision from the synthesized meshes.

Holistic 3D Human and Scene Mesh Estimation from Single View Images
Zhenzhen Weng, Serena Yeung
Conference on Computer Vision and Pattern Recognition (CVPR), 2021

We propose a holistically trainable model that perceives the 3D scene from a single RGB image, estimates the camera pose and the room layout, and reconstructs both human body and object meshes.

Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision
Zhenzhen Weng, Mehmet Giray Ogut, Shai Limonchik, Serena Yeung
Conference on Computer Vision and Pattern Recognition (CVPR), 2021

We propose a method that can perform unsupervised discovery of long-tail categories in instance segmentation, through learning instance embeddings of masked regions.

Slice-based learning: A programming model for residual learning in critical data slices
Vincent S Chen, Sen Wu, Zhenzhen Weng, Alexander Ratner, Christopher Ré
The Conference and Workshop on Neural Information Processing Systems (NeurIPS), 2019

We introduce the challenge of improving slice-specific performance without damaging the overall model quality, and proposed the first programming abstraction and machine learning model to support these actions.

Utilizing Weak Supervision to Infer Complex Objects and Situations in Autonomous Driving Data
Zhenzhen Weng, Paroma Varma, Alexander Masalov, Jeffrey Ota, Christopher Ré
IEEE Intelligent Vehicles Symposium (IEEE IV), 2019

We introduced weak supervision heuristics as a methodology to infer complex objects and situations by combining simpler outputs from current, state-of-the art object detectors.

Work Experience

Research Scientist Intern @ Adobe Research, San Jose, CA, Jun - Sept, 2023

Research Intern (Perception) @ Waymo, Mountain View, CA, Jun - Nov, 2022

Machine Learning Engineer @ VMware, Palo Alto, CA, Jun - Sept, 2019

Research Engineer @ AQR, Greenwich, CT, 2016 - 2018

Webpage template and source code from Jon Barron.