Vision Large Language Model

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Chinese AI startup Zhipu AI aka Z.ai has released its GLM-4.6V series, a new generation of open-source vision-language models (VLMs) optimized for multimodal reasoning, frontend automation, and ...

VentureBeat

Microsoft drops Florence-2, a unified model to handle a variety of vision tasks

Join the event trusted by enterprise leaders for nearly two decades. VB Transform brings together the people building real enterprise AI strategy. Learn more Today, Microsoft’s Az u re AI team dropped ...

Fastest AI Vision Model for Your Laptop : Liquid AI LFM 2.5

Liquid AI’s LFM 2.5 runs a vision-language model locally in your browser via WebGPU and ONNX Runtime, working offline once ...

techtimes

Google Joins the Vision-Language Model with PaliGemma 2, But How Will It Help its AI Charge?

There are different types of AI models available in the market for users to choose from, and it will largely depend on the type of service they need from the machine learning technology, and Google ...

14d

DeepRoute.ai Presents 40B Vision-Language-Action Foundation Model at NVIDIA GTC 2026, Accelerating Autonomous Driving at Scale

At NVIDIA GTC 2026, DeepRoute.ai presented a comprehensive introduction to its 40-billion-parameter Vision-Language-Action (VLA) Foundation Model architecture, representing a fundamental breakthrough ...

Forbes

Show inaccessible results

Z.ai debuts open source GLM-4.6V, a native tool-calling vision model for multimodal reasoning

Microsoft drops Florence-2, a unified model to handle a variety of vision tasks

Fastest AI Vision Model for Your Laptop : Liquid AI LFM 2.5

Google Joins the Vision-Language Model with PaliGemma 2, But How Will It Help its AI Charge?

DeepRoute.ai Presents 40B Vision-Language-Action Foundation Model at NVIDIA GTC 2026, Accelerating Autonomous Driving at Scale

How Vision Language Models Will Shape The Future Of Self-Driving Cars

Study shows vision-language models can't handle queries with negation words

Vision Language Models Come Rushing In

What is Llama? Meta AI’s family of large language models explained