paper-matching / details.html
jskim's picture
adding knob to control number of highlights. replacing the main model with specter2. using specter2 for sentence-level highlight as well.
6004e76
raw
history blame
2.43 kB
<!DOCTYPE html>
<html>
<body>
<h1>Relevant Work</h1>
The tool is developed by <a href="https://wnstlr.github.io", target="_blank">Joon Sik Kim</a>, based on the findings from our paper <a href="https://arxiv.org/abs/2302.08450", target="_blank">Assisting Human Decisions in Document Matching</a>, with <a href="https://valeriechen.github.io/", target="_blank">Valerie Chen</a>, <a href="https://danishpruthi.com/", target="_blank">Danish Pruthi</a>, <a href="https://www.cs.cmu.edu/~nihars/", target="_blank">Nihar B. Shah</a>, and <a href="https://www.cs.cmu.edu/~atalwalk/", target="_blank">Ameet Talwalkar</a>.
<h1>What Happens Behind the Scenes</h1>
<ul>
<li> The tool retrieves the reviewer's previous publications using <a href="https://www.semanticscholar.org/product/api", target="_blank">Semantic Scholar API</a>.</li>
<li> The tool computes the affinity score between the submission abstract and each paper's abstract, using text representations from a <a href="https://huggingface.co/allenai/specter2", target="_blank">language model fine-tuned on academic papers</a>.</li>
<li> The tool then computes pairwise sentence relevance scores between the submission abstract and the reviewer paper's abstract, using text representations from <a href="https://huggingface.co/allenai/specter2", target="_blank">the same model</a>.</li>
<li> The tool highlights overlapping words (nouns) between setence pairs using <a href="https://www.nltk.org/book/ch05.html", target="_blank">POS tagging</a>.</li>
</ul>
<h1>Privacy Policy</h1>
The tool does not compromise the privacy of the reviewers. It only relies only on their publicly-available information (e.g., names and list of previously published papers). All input information will only be temporarily used for internal computation, will not be saved externally, and will be deleted when the session is refreshed or closed. No search history will be kept.
<h1>Disclaimer</h1>
This tool and its output should not serve as the sole justification for confirming a match for the submission. It is intended as a <b>supplementary</b> tool that the users may use at their discretion. The correctness of the output of the tool is not guaranteed. The search results may be improved by updating the internal models used to compute the affinity scores and sentence relevance, which may require additional independent research and experimentation.
</body>
</html>