Elon Musk’s xAI previews Grok-1.5V, its first multimodal model

Viewed 24

Elon Musk’s xAI has introduced its first multimodal model. Not only can it understand text, but it’s also capable of processing things seen in documents, diagrams, charts, screenshots and photographs. Grok-1.5 Vision, or Grok-1.5V, will be available soon to early testers and existing Grok users.

1 Answers

A minimal Docker image based on Alpine Linux with a complete package index and only 5 MB in size!