Spaces:
Build error
Build error
phi-3.5 rpp results
Browse files- logs/l40-1gpu-5.txt +0 -0
- logs/l40-4gpu-8.txt +9 -0
- results/mac-results_rpp_with_mnt_2048.csv +0 -0
logs/l40-1gpu-5.txt
ADDED
The diff for this file is too large to render.
See raw diff
|
|
logs/l40-4gpu-8.txt
CHANGED
@@ -175,3 +175,12 @@ You seem to be using the pipelines sequentially on GPU. In order to maximize eff
|
|
175 |
The `seen_tokens` attribute is deprecated and will be removed in v4.41. Use the `cache_position` model input instead.
|
176 |
2024-08-26 14:30:49,269 [WARNING] [logging.py:328] You are not running the flash-attention implementation, expect numerical differences.
|
177 |
You seem to be using the pipelines sequentially on GPU. In order to maximize efficiency please use a dataset
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
175 |
The `seen_tokens` attribute is deprecated and will be removed in v4.41. Use the `cache_position` model input instead.
|
176 |
2024-08-26 14:30:49,269 [WARNING] [logging.py:328] You are not running the flash-attention implementation, expect numerical differences.
|
177 |
You seem to be using the pipelines sequentially on GPU. In order to maximize efficiency please use a dataset
|
178 |
+
[nltk_data] Downloading package punkt to
|
179 |
+
[nltk_data] /common/home/users/d/dh.huang.2023/nltk_data...
|
180 |
+
[nltk_data] Package punkt is already up-to-date!
|
181 |
+
2024-08-26 19:44:35,757 [WARNING] [modeling_phi3.py:62] `flash-attention` package not found, consider installing for better performance: No module named 'flash_attn'.
|
182 |
+
2024-08-26 19:44:35,757 [WARNING] [modeling_phi3.py:66] Current `flash-attention` does not support `window_size`. Either upgrade or use `attn_implementation='eager'`.
|
183 |
+
|
184 |
+
The `seen_tokens` attribute is deprecated and will be removed in v4.41. Use the `cache_position` model input instead.
|
185 |
+
2024-08-26 19:45:11,738 [WARNING] [logging.py:328] You are not running the flash-attention implementation, expect numerical differences.
|
186 |
+
You seem to be using the pipelines sequentially on GPU. In order to maximize efficiency please use a dataset
|
results/mac-results_rpp_with_mnt_2048.csv
CHANGED
The diff for this file is too large to render.
See raw diff
|
|