Update README.md
Browse files
README.md
CHANGED
@@ -15,10 +15,10 @@ For more details about SwiftKV and how to use it:
|
|
15 |
|
16 |
## Performance Metrics
|
17 |
|
18 |
-
To evaluate SwiftKV’s performance, we focus on the following key metrics:
|
19 |
* Combined throughput: The total number of input and output tokens processed per second. This determines:
|
20 |
-
For batch processing, the time required to complete jobs.
|
21 |
-
For interactive use, the volume of concurrent requests a system can handle.
|
22 |
* TTFT: The latency between a user request and receiving the first token in the response.
|
23 |
* TPOT: The latency between subsequent tokens after the first token.
|
24 |
|
|
|
15 |
|
16 |
## Performance Metrics
|
17 |
|
18 |
+
To evaluate SwiftKV’s performance, we focus on the following key metrics (see more details in our [blog](https://www.snowflake.com/engineering-blog/swiftkv-llm-compute-reduction/)):
|
19 |
* Combined throughput: The total number of input and output tokens processed per second. This determines:
|
20 |
+
* For batch processing, the time required to complete jobs.
|
21 |
+
* For interactive use, the volume of concurrent requests a system can handle.
|
22 |
* TTFT: The latency between a user request and receiving the first token in the response.
|
23 |
* TPOT: The latency between subsequent tokens after the first token.
|
24 |
|