Model Card for Model ID

2024.4.4 Update

This model is a sentiment analysis model designed to determine the positive/neutral/negative sentiment of sentences included in corporate-related news.

This model is targeted to provide sentiment for "important news", as described in the paper mentioned following. So, the results may not be accurate for less important news.

It can be used as a Korean-based sentiment analysis model for the finance/management/accounting fields.

Example>

"Samsung's debt is increasing." --> Neutral. The mere increase in debt is not necessarily negative.

"Due to the failure of management strategy, Samsung's debt is increasing." --> Negative. Debt increase due to failure is considered negative.

Hyun Ji-won, Lee Jun-il, and Cho Hyun-kwon. "A Study on Sentiment Classification of Corporate-related News Articles Using KoBERT." Accounting Research 47.4 (2022): 33-54.

We have further developed the model proposed in the above paper and made it available through Huggingface. If you use it for research purposes, please cite the above paper.

This model was fine-tuned using https://huggingface.co/jhgan/ko-sroberta-multitask.

For the usage code, refer to the link below:

Google Colab: https://colab.research.google.com/drive/1ORzKUr94cPyc5jaRCAngbclm4Qb4DtdG

The current evaluation results of the model are as follows:

{'eval_loss': 0.7330707907676697, 'eval_f1': 0.8689251403360293, 'eval_runtime': 0.464, 'eval_samples_per_second': 2047.32, 'eval_steps_per_second': 17.241, 'epoch': 33.33}

While the accuracy has increased compared to the paper's 85.7%, the improvement is not significant.

2024.4.4 Update

이 모형은 기업관련 뉴스에 포함된 문장의 긍정/중립/부정을 판단하기 위한 감성분석 모형입니다.

이 모형은 하단 논문에서 설명한 바와 같이 중요한 뉴스의 감성분석결과를 제공하도록 훈련되었으므로, 중요성이 낮은 뉴스에 대한 감성분석 결과는 정확하지 않을 수 있습니다.

한국어 기반 금융/경영/회계 분야 감성분석 모형으로 사용하시면 됩니다.

예시>

삼성전자의 부채가 증가하고 있습니다. --> 중립 (neutral). 부채증가 자체는 부정적이라고 보기 어려움

경영전략의 실패로 삼성전자의 부채가 증가하고 있습니다. --> 부정 (negative). 실패로 인한 부채 증가는 부정적

현지원, 이준일, and 조현권. "KoBERT 를 이용한 기업관련 신문기사 감성 분류 연구." 회계학연구 47.4 (2022): 33-54.

위 논문에서 제안한 모델을 발전시켜 huggingface를 통해 공개합니다. 연구에 사용하실 경우 위 페이퍼를 cite 해 주시기 바랍니다.

해당 모델은 https://huggingface.co/jhgan/ko-sroberta-multitask 를 사용하여 finetuing 한 모형입니다.

사용 코드는 아래 링크를 참고하셔요

구글 코랩: https://colab.research.google.com/drive/1ORzKUr94cPyc5jaRCAngbclm4Qb4DtdG

현재 모형의 evaluation 결과는 다음과 같습니다.

{'eval_loss': 0.7330707907676697, 'eval_f1': 0.8689251403360293, 'eval_runtime': 0.464, 'eval_samples_per_second': 2047.32, 'eval_steps_per_second': 17.241, 'epoch': 33.33}

정확도 기준으로 논문의 85.7% 에 비해 상승하였으나, 상승폭이 현저하지는 않습니다.

kwoncho
/

KoFinBERT

Model Card for Model ID

Model Details

Model Description