Improve performance with contextual compression, a technique where retrieved documents are compressed, and irrelevant information is filtered out. 9caad80 vinhnx90 commited on Apr 2
Disable tokenizer transformer parallelism to avoid deadlocks 91d4c2f unverified Vinh Nguyen commited on Apr 2