š„šš New Research Alert - HeadGAP (Avatars Collection)! ššš„ š Title: HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors š
š Description: HeadGAP introduces a novel method for generating high-fidelity, animatable 3D head avatars from few-shot data, using Gaussian priors and dynamic part-based modelling for personalized and generalizable results.
ššŗš New Research Alert - ECCV 2024 (Avatars Collection)! ššš š Title: Expressive Whole-Body 3D Gaussian Avatar š
š Description: ExAvatar is a model that generates animatable 3D human avatars with facial expressions and hand movements from short monocular videos using a hybrid mesh and 3D Gaussian representation.
š„ Authors: Gyeongsik Moon, Takaaki Shiratori, and @psyth
š„šš New Research Alert - ECCV 2024 (Avatars Collection)! ššš„ š Title: MeshAvatar: Learning High-quality Triangular Human Avatars from Multi-view Videos š
š Description: MeshAvatar is a novel pipeline that generates high-quality triangular human avatars from multi-view videos, enabling realistic editing and rendering through a mesh-based approach with physics-based decomposition.
š„ Authors: Yushuo Chen, Zerong Zheng, Zhe Li, Chao Xu, and Yebin Liu
Plugin Names: 1. WebSearch: Searches the web using search engines. 2. Calculator: Evaluates mathematical expressions, extending the base Tool class. 3. WebBrowser: Extracts and summarizes information from web pages. 4. Wikipedia: Retrieves information from Wikipedia using its API. 5. Arxiv: Searches and fetches article information from Arxiv. 6. WolframAlphaTool: Provides answers on math, science, technology, culture, society, and everyday life.
These plugins currently support the GPT-4O-2024-08-06 model, which also supports image analysis.
Introducing Plugins in NiansuhAI (on July 20, 2024)
Plugin Names: 1. WebSearch: Tool for searching the web using search engines. 2. Calculator: Helps evaluate mathematical expressions; extends the base Tool class. 3. WebBrowser: Interacts with web pages to extract information or summarize content. 4. Wikipedia: Retrieves data from Wikipedia using its API. 5. Arxiv: Searches and fetches article information from Arxiv. 6. WolframAlphaTool: Answers questions on Math, Science, Technology, Culture, Society, and Everyday Life.
ššŗš New Research Alert - CVPR 2024 (Avatars Collection)! ššš š Title: IntrinsicAvatar: Physically Based Inverse Rendering of Dynamic Humans from Monocular Videos via Explicit Ray Tracing š
š Description: IntrinsicAvatar is a method for extracting high-quality geometry, albedo, material, and lighting properties of clothed human avatars from monocular videos using explicit ray tracing and volumetric scattering, enabling realistic animations under varying lighting conditions.
š„ Authors: Shaofei Wang, Božidar AntiÄ, Andreas Geiger, and Siyu Tang
š Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA šŗšø
š„šš New Research Alert - ECCV 2024 (Avatars Collection)! ššš„ š Title: RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models š
š Description: RodinHD generates high-fidelity 3D avatars from portrait images using a novel data scheduling strategy and weight consolidation regularization to capture intricate details such as hairstyles.
š„šš New Research Alert - LivePortrait (Avatars Collection)! ššš„ š Title: LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control š
š Description: LivePortrait is an efficient video-driven portrait animation framework that uses implicit keypoints and stitching/retargeting modules to generate high-quality, controllable animations from a single source image.
š„ Authors: @cleardusk, Dingyun Zhang, Xiaoqiang Liu, Zhizhou Zhong, Yuan Zhang, Pengfei Wan, and Di Zhang
ššŗš New Research Alert (Avatars Collection)! ššš š Title: Expressive Gaussian Human Avatars from Monocular RGB Video š
š Description: The new EVA model enhances the expressiveness of digital avatars by using 3D Gaussians and SMPL-X to capture fine-grained hand and face details from monocular RGB video.
š„ Authors: Hezhen Hu, Zhiwen Fan, Tianhao Wu, Yihan Xi, Seoyoung Lee, Georgios Pavlakos, and Zhangyang Wang
š„šš New Research Alert - ECCV 2024 (Avatars Collection)! ššš„ š Title: Topo4D: Topology-Preserving Gaussian Splatting for High-Fidelity 4D Head Capture š
š Description: Topo4D is a novel method for automated, high-fidelity 4D head tracking that optimizes dynamic topological meshes and 8K texture maps from multi-view time-series images.
š„ Authors: @Dazz1e, Y. Cheng, @Ryan-sjtu, H. Jia, D. Xu, W. Zhu, Y. Yan
ššš New Research Alert - Portrait4D-v2 (Avatars Collection)! ššš š Title: Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer š
š Description: Portrait4D-v2 is a novel method for one-shot 4D head avatar synthesis using pseudo multi-view videos and a vision transformer backbone, achieving superior performance without relying on 3DMM reconstruction.
š„ Authors: Yu Deng, Duomin Wang, and Baoyuan Wang
šš²šš” New Research Alert - CVPRW 2024 (Facial Expressions Recognition Collection)! š”š„š„“š± š Title: Zero-Shot Audio-Visual Compound Expression Recognition Method based on Emotion Probability Fusion š
š Description: AVCER is a novel audio-visual method for compound expression recognition based on pair-wise sum of emotion probability, evaluated in multi- and cross-corpus setups without task-specific training data, demonstrating its potential for intelligent emotion annotation tools.
ššš New Research Alert - CVPR 2024 (Avatars Collection)! ššš š Title: Relightable Gaussian Codec Avatars š
š Description: Relightable Gaussian Codec Avatars is a method for creating highly detailed and relightable 3D head avatars that can animate expressions in real time and support complex features such as hair and skin with efficient rendering suitable for VR.
ššš New Research Alert - InstructAvatar (Avatars Collection)! ššš š Title: InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation š
š Description: InstructAvatar is a novel method for generating emotionally expressive 2D avatars using text-guided instructions, offering improved emotion control, lip-sync quality, and naturalness. It uses a two-branch diffusion-based generator to predict avatars based on both audio and text input.
š„šš New Research Alert - YOLOv10! ššš„ š Title: YOLOv10: Real-Time End-to-End Object Detection š
š Description: YOLOv10 improves real-time object recognition by eliminating non-maximum suppression and optimizing the model architecture to achieve state-of-the-art performance with lower latency and computational overhead.
ššš New Research Alert - Gaussian Head & Shoulders (Avatars Collection)! ššš š Title: Gaussian Head & Shoulders: High Fidelity Neural Upper Body Avatars with Anchor Gaussian Guided Texture Warping š
š Description: Gaussian Head & Shoulders is a method for creating high-fidelity upper body avatars by integrating 3D morphable head models with a neural texture warping approach to overcome the limitations of Gaussian splatting.
šš¤š New Research Alert - CVPR 2024! šš¤š š Title: RoHM: Robust Human Motion Reconstruction via Diffusion š
š Description: RoHM is a diffusion-based approach for robust 3D human motion reconstruction from monocular RGB(-D) videos, effectively handling noise and occlusions to produce complete and coherent motions. This method outperforms current techniques in various tasks and is faster at test time.
š„ Authors: Siwei Zhang et al.
š Conference: CVPR, Jun 17-21, 2024 | Seattle WA, USA šŗšø