toolkit / crawl

Commit History

update base_path in crawl
0e2c9c8

k4d3 commited on

update joy and crawl UwU
2e83eb5

k4d3 commited on

Fix: Correct retry logic to prevent multiple restarts
6c29921

k4d3 commited on

Fix: Move retry_crawl function definition above its usage
9b715b2

k4d3 commited on

dont redefine variables from outer scope + code formatting
a7f9c9a

k4d3 commited on

feat: Add parallel execution for multiple URLs
e4d6e76

k4d3 commited on

retry
c239ce8

k4d3 commited on

update crawl
026c9c4

k4d3 commited on

trailing whitespace yay
d86635a

k4d3 commited on

handle multiple urls
f86ba1d

k4d3 commited on

fix downloading images when the domain is missing
bd31f82

k4d3 commited on

Revert "fix: Add validation for image URLs to prevent invalid downloads"
74e179d

k4d3 commited on

Revert "fix: Enhance URL validation for image downloads- Improved the URL validation in the download_image function.- Used requests.utils.urlparse to parse the URL and check for a valid scheme and netloc.- Ensured the URL path does not contain invalid labels (e.g., ..).- This prevents the script from attempting to download images with invalid URLs and provides clearer error messages."
0cb8585

k4d3 commited on

Revert "fix: Improve URL validation to catch invalid labels in image downloads"
e4302bc

k4d3 commited on

fix: Improve URL validation to catch invalid labels in image downloads
fc02b0f

k4d3 commited on

fix: Enhance URL validation for image downloads- Improved the URL validation in the download_image function.- Used requests.utils.urlparse to parse the URL and check for a valid scheme and netloc.- Ensured the URL path does not contain invalid labels (e.g., ..).- This prevents the script from attempting to download images with invalid URLs and provides clearer error messages.
8139183

k4d3 commited on

fix: Add validation for image URLs to prevent invalid downloads
852b094

k4d3 commited on

Revert "Improve error handling in save_result function"
1004c51

k4d3 commited on

Improve error handling in save_result function
06e576c

k4d3 commited on

Initial commit
f1a2ec8

k4d3 commited on