--- license: apache-2.0 tags: - Computer - computervision --- # Uses This LLM is trained on data generated by my code for the yolov8 model. [Github code](https://github.com/bauerhartmut/yolov8-Computervision) The model is capable of briefly describing what the yolov8 model can detect and can also execute a command (/click). When the command is triggered, a dictionary is generated containing the key data of the object to be clicked. # Testing You can test the model by giving it this informations: ```json { "Object": [ { "index": "window_0", "label": "window", "property": "toplayer", "coords": [ 189.06007385253906, 79.33326721191406, 1156.018798828125, 750.1478271484375 ], "textes": 24, "interactions": [ { "label": "close_window", "interaction_type": 1, "coords": [ 1114.04541015625, 84.65348815917969, 1149.1778564453125, 113.41248321533203 ] }, { "label": "maximize", "interaction_type": 1, "coords": [ 1067.0111083984375, 84.82215118408203, 1099.86328125, 112.69491577148438 ] }, { "label": "minize_window", "interaction_type": 1, "coords": [ 1024.7701416015625, 85.06327819824219, 1053.4327392578125, 111.52396392822266 ] } ] } ] } ``` You can give the model this informations and a prompt like "Was siehst du" or "Kannst du das Fenster schließen". The Model is at the moment only trained on german.