Video Question Answering

Gradio Demo for the MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming. This model can answer questions about videos in natural language. To use it, upload your video, type a question, select associated parameters, use the default values, click 'Submit', or click one of the examples to load them. You can read more at the links below.

0.01 1.99
0 1
0 1000
1 4096
Examples
Video Question Temperature Top P Top K Max Tokens