research / documents /docs /2-总结功能.md
haoqi7's picture
Upload 47 files
16188ba

A newer version of the Streamlit SDK is available: 1.39.0

Upgrade

2 Research Trends Summarization

Model Architecture

1 Baseline Configuration

  1. pre-trained language model: sentence-transformers/all-MiniLM-L6-v2
  2. dimension reduction: None
  3. clustering algorithms: kmeans
  4. keywords extraction model: keyphrase-transformer

[example run] [results]

TODO:

  1. clustering: using other clustering algorithms such as Gausian Mixture Model (GMM)
  2. keywords extraction model: train another model
  3. add dimension reduction
  4. better PLM: sentence-transformers/sentence-t5-xxl