File size: 573 Bytes
47e848f
 
 
 
 
053064a
 
 
 
47e848f
 
053064a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
47e848f
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
---
tags:
- panda70m
- video2text
---

# How to use?

```bash
huggingface-cli download Ligeng-Zhu/panda70m \
  --local-dir panda70m --repo-type dataset --local-dir-use-symlinks False
```

Then install dependencies

```bash
pip install fire yt_dlp pandas
```

Next pull the videos

```bash
python main.py --csv=<your csv files>
```

or split by shards to accelerate downloading

```bash
python main.py --csv=<your csv files> --shards=0 --total=10
python main.py --csv=<your csv files> --shards=1 --total=10
...
python main.py --csv=<your csv files> --shards=9 --total=10
```