MythicalCow commited on
Commit
3abe895
·
2 Parent(s): 749a83f 32015fa

readme update

Browse files
Files changed (1) hide show
  1. README.md +20 -1
README.md CHANGED
@@ -93,6 +93,21 @@ widget:
93
 
94
  '
95
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
96
 
97
  # SentenceTransformer based on Snowflake/snowflake-arctic-embed-l
98
 
@@ -3614,4 +3629,8 @@ You can finetune this model on your own dataset.
3614
  ## Model Card Contact
3615
 
3616
  *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
3617
- -->
 
 
 
 
 
93
 
94
  '
95
  ---
96
+ # Github and Technical Report
97
+ To view our technical report and for access to other model and dataset related scripts visit our github [Github](https://github.com/khoj-ai/timely/tree/main)
98
+
99
+ # Timely: An Embeddings Model For Temporal Reasoning
100
+
101
+ At Khoj, we develop open-source personal AI to simplify how people engage with machines. The RAG component in modern AI systems commonly uses an embedding model to retrieve relevant documents for a user query. This retrieved-context enables accurate and personalized responses.
102
+
103
+ However, most of these models struggle with temporal reasoning. For instance, if asked "Where was I last summer?", the model would struggle to understand the framing of that question. It requires us to understand the relativity of time (that 2010 is before 2011), and when summer might be (between May - September).
104
+
105
+ When we express dates, we often use shorthands like ‘back in June’, ‘on summer break’, and ‘06/15’; all syntaxes that models don’t presently handle well. As such, your embedding model may not find documents with dates within that specific period. This limitation is significant, given the importance of time and date in language and daily life.
106
+
107
+ To address this problem, we propose **Timely**, a comprehensive pipeline for date-aware dataset generation, model fine-tuning, and benchmarking. Specifically, our goal is to create models that can:
108
+
109
+ 1. Identify natural language dates in queries and documents better
110
+ 2. Can handle relative and soft data filters more naturally (e.g. discerning that June is closer than November when talking about Spring).
111
 
112
  # SentenceTransformer based on Snowflake/snowflake-arctic-embed-l
113
 
 
3629
  ## Model Card Contact
3630
 
3631
  *Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
3632
+ -->
3633
+ - loss:MultipleNegativesRankingLoss
3634
+ datasets:
3635
+ - sentence-transformers/wikihow
3636
+ ---