tsunghanwu
/

SESAME_minus

Text Generation

Inference Endpoints

Model card Files Files and versions Community

tsunghanwu commited on Jun 16, 2024

Commit

5588b4a

·

verified ·

1 Parent(s): 2f30d53

Update README.md

Files changed (1) hide show

README.md +13 -3

README.md CHANGED Viewed

@@ -1,3 +1,13 @@
----
-license: mit
----

+---
+license: mit
+---
+## SESAME_minus
+- Model type: SESAME_minus is an open-source multimodal model trained by fine-tuning LLaVA on various instruction-based image grounding (segmentation) data. It is an instruction-baed segmentation model basically, serving as a baseline.
+- Paper or resources for more information: https://see-say-segment.github.io/
+- Where to send questions or comments about the model: https://github.com/see-say-segment/sesame/issues
+- Intended use
+  - Primary intended uses: The primary use of SESAME is research on large multimodal models and chatbots.
+  - Primary intended users: The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
+- Training dataset: RefCOCO(+/g)