Safetensors
valley
wuziheng Hyggge commited on
Commit
2b948bc
·
verified ·
1 Parent(s): 3092771

Update README.md (#3)

Browse files

- Update README.md (5b3f595b149012d64eda2411264a4e9fb357f06c)


Co-authored-by: Zhenghao Chen <Hyggge@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +42 -2
README.md CHANGED
@@ -4,8 +4,17 @@ base_model:
4
  - Qwen/Qwen2.5-7B-Instruct
5
  ---
6
  # Valley 2.0
 
 
 
 
 
 
 
 
 
7
  ## Introduction
8
- Valley [github](https://github.com/bytedance/Valley) is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data, which is developed by ByteDance. Our model not only
9
 
10
  - Achieved the best results in the inhouse e-commerce and short-video benchmarks
11
  - Demonstrated comparatively outstanding performance in the OpenCompass (average scores > 67) tests
@@ -39,5 +48,36 @@ pip install -r requirements.txt
39
  All of our open-source models are licensed under the Apache-2.0 license.
40
 
41
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
42
  ## Citation
43
- Coming Soon!
 
 
 
 
 
 
 
 
4
  - Qwen/Qwen2.5-7B-Instruct
5
  ---
6
  # Valley 2.0
7
+
8
+ <p align="center">
9
+ <img src="https://raw.githubusercontent.com/bytedance/Valley/refs/heads/main/assets/valley_logo.jpg" width="500"/>
10
+ <p>
11
+
12
+ <p align="center">
13
+ 🎮️ <a href="https://github.com/bytedance/Valley">Github</a>&nbsp&nbsp | &nbsp&nbsp 🤗 <a href="https://huggingface.co/bytedance-research/Valley-Eagle-7B">Hugging Face</a>&nbsp&nbsp | &nbsp&nbsp🤖 <a href="https://www.modelscope.cn/models/Hyggge/Valley-Eagle-7B">ModelScope</a> &nbsp&nbsp | &nbsp&nbsp 📑 <a href="https://hyggge.github.io/projects/valley/index.html">Home Page</a> &nbsp&nbsp | &nbsp&nbsp 📙 <a href="https://arxiv.org/abs/2501.05901">Paper</a>
14
+ </p>
15
+
16
  ## Introduction
17
+ Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data, which is developed by ByteDance. Our model not only
18
 
19
  - Achieved the best results in the inhouse e-commerce and short-video benchmarks
20
  - Demonstrated comparatively outstanding performance in the OpenCompass (average scores > 67) tests
 
48
  All of our open-source models are licensed under the Apache-2.0 license.
49
 
50
 
51
+ ## Related Project
52
+ We list related Project
53
+ - [Valley: Video Assistant with Large Language model Enhanced abilitY](https://github.com/RupertLuo/Valley)
54
+ - [LLaVA: Large Language and Vision Assistant](https://github.com/haotian-liu/LLaVA)
55
+ - [Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders](https://github.com/NVlabs/EAGLE)
56
+ - [LLaVA-CoT: Let Vision Language Models Reason Step-by-Step](https://github.com/PKU-YuanGroup/LLaVA-CoT)
57
+ - [Qwen2.5](https://github.com/QwenLM/Qwen2.5)
58
+
59
+ ## License Agreement
60
+ All of our open-source models are licensed under the [Apache-2.0](./LICENSE) license.
61
+
62
+ ## We are Hiring
63
+ The Data-Ecommerce-Platform Governance-Basic Algorithms Team focuses on the research and development of multi-modal large model algorithms and foundational algorithms, continuously delving deeply into this field. Our mission is to optimize algorithms and collaborate with business teams to comprehensively govern the quality and ecosystem of ByteDance's e-commerce products. Currently, the team has a strong demand for foundational algorithm expertise in NLP, CV, and multimodal technologies. We welcome inquiries and look forward to working on challenging projects with talented individuals like you!
64
+
65
+ Location: Beijing / Shanghai / Singapore
66
+
67
+ Contact & Resume Submission: wuheng.2024@bytedance.com
68
+
69
+ > Data-电商-平台治理-基础算法团队专注于多模态大模型算法和基础算法的研发,并在此方向上持续深耕。我们的使命是通过算法优化,和业务团队配合,对字节旗下电商产品进行全方位的质量和生态治理。目前团队对NLP/CV/多模态等方向的基础算法技术有强需求,欢迎咨询,期待和优秀的你,一起做有挑战的事情!
70
+ >
71
+ > 岗位城市:北京/上海/新加坡
72
+ >
73
+ > 咨询&简历投递:wuheng.2024@bytedance.com
74
+
75
  ## Citation
76
+ ```
77
+ @article{wu2025valley2,
78
+ title={Valley2: Exploring Multimodal Models with Scalable Vision-Language Design},
79
+ author={Wu, Ziheng and Chen, Zhenghao and Luo, Ruipu and Zhang, Can and Gao, Yuan and He, Zhentao and Wang, Xian and Lin, Haoran and Qiu, Minghui},
80
+ journal={arXiv preprint arXiv:2501.05901},
81
+ year={2025}
82
+ }
83
+ ```