[CVPR 2024 ๐ฅ] Grounding Large Multimodal Model (GLaMM), the first-of-its-kind model capable of generating natural language responses that are seamlessly integrated with object segmentation masks.
groundingLMM is a standout agent — gaining momentum with 949 stars, a solid trust score of 60.6/100, and native support for REST.
Analyze, generate, and transform visuals
| Type | Agent |
| Language | Python |
| Trust Score | 60.6/100 (High) |
| Stars | ★ 949 |
| Categories | General |
| Protocols | REST |
| Source | https://github.com/mbzuai-oryx/groundingLMM |
Add a trust badge to your README:
[](https://fushu.dev/agent/2e39b81072de)
click to copy
Install now and integrate into your workflow in minutes.