Transformer Model Applications

A look under the hood of transfomers, the engine driving AI model evolution

Today, virtually every cutting-edge AI product and model uses a transformer architecture. Large language models (LLMs) such as GPT-4o, LLaMA, Gemini and Claude are all transformer-based, and other AI ...

College of Computing - Georgia Tech

Transformer Explainer Shows How AI is More Math than Human

This sets unrealistic expectations for AI and leads to misuse. It also slows progress toward building new AI applications.

Android Police

Transformers: Everything you need to know about the deep learning model

Ben Khalesi writes about where artificial intelligence, consumer tech, and everyday technology intersect for Android Police. With a background in AI and Data Science, he’s great at turning geek speak ...

SiliconANGLE

AI21 Labs’ updated hybrid SSM-Transformer model Jamba gets longest context window yet

OpenAI rival AI21 Labs Ltd. today lifted the lid off of its latest competitor to ChatGPT, unveiling the open-source large language models Jamba 1.5 Mini and Jamba 1.5 Large. The new models are based ...

Neowin

Microsoft builds the world's largest transformer-based language generation model

Boasting over 17 billion parameters and 78 transformer layers, Microsoft's new Turing Natural Language Generation model outperforms many state-of-the-art models available currently. Transformer-based ...

Semiconductor Engineering

AI Transformer Models Enable Machine Vision Object Detection

The object detection required for machine vision applications such as autonomous driving, smart manufacturing, and surveillance applications depends on AI modeling. The goal now is to improve the ...

14d

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

This release is good for developers building long-context applications, real-time reasoning agents, or those seeking to ...

CU Boulder News & Events

Building a Vision Transformer Model From Scratch

The self-attention-based transformer model was first introduced by Vaswani et al. in their paper Attention Is All You Need in 2017 and has been widely used in natural language processing. A ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results