Connect LMM For Classification to other blocks to build a custom workflow
Classify an image into one or more categories using a Large Multimodal Model (LMM).
You can specify arbitrary classes to an LMMBlock.
The LLMBlock supports two LMMs:
- OpenAI's GPT-4 with Vision, and;
- CogVLM.
You need to provide your OpenAI API key to use the GPT-4 with Vision model. You do not
need to provide an API key to use CogVLM.
Connect pre-trained models, open source models, LLM APIs, advanced logic, and external applications. Deploy as an API endpoint, on-prem, or at the edge.