Update README.md
Browse files
README.md
CHANGED
@@ -1,17 +1,3 @@
|
|
1 |
-
|
2 |
-
license: mit
|
3 |
-
---
|
4 |
|
5 |
-
|
6 |
-
|
7 |
-
This is the retrieval model for [ReACC: A Retrieval-Augmented Code Completion Framework](https://arxiv.org/abs/2203.07722).
|
8 |
-
|
9 |
-
In this paper, the model is used to retrieve similar codes given an incompletion code snippet as query. The model can be also used for incomplete code-to-code search, code clone detection.
|
10 |
-
|
11 |
-
`py-retriever` is BERT-like encoder consisting of 12 transformer layers. It is continual pre-trained on [GraphCodeBERT](https://huggingface.co/microsoft/graphcodebert-base) with contrastive learning in Python programming language. More details can be found in our paper.
|
12 |
-
|
13 |
-
Note that the format of input codes is different from original source code. We normalize the source codes to better capture information from line break and indention in Python. An example of input is:
|
14 |
-
```python
|
15 |
-
sum = 0<endofline>for val in numbers:<endofline><INDENT>sum = sum+val
|
16 |
-
```
|
17 |
-
To get more information about how to convert source codes into this format, please refer to [ReACC GitHub repo](https://github.com/microsoft/ReACC).
|
|
|
1 |
+
py-retriever
|
|
|
|
|
2 |
|
3 |
+
python代码检索器
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|