🎉 TwelveLabs Raises $100M Series B to build the future of video superintelligence. Read more.

Platform

Pricing

Solutions

Build

Resources

Company

Select Language

Playground

Talk to Sales

🎉 TwelveLabs Raises $100M Series B to build the future of video superintelligence. Read more.

Modeling the world.   Remodeling video.

TwelveLabs models can see and reason about video like no other AI – and they set the standard for a new era of video data interaction.

Marengo 3.0

Our breakthrough video foundation model analyzes frames and their temporal relationships, along with speech and sound — a huge leap forward for search and any-to-any retrieval tasks.

Learn more

Pegasus 1.5

Our powerful video-first language model integrates visual, audio, and speech information — and employs this deep video understanding to reach new heights in text generation.

Learn more

Models

Marengo transforms text, audio, image, and video into numerical representations called embeddings.

Embeddings enable unsurpassed information retrieval. Now, you can perform powerful cross-modal searches – across text, audio, image and video.

This any-to-any retrieval can transform applications. Content discovery, recommendation systems, description and analysis will all change for good.

At TwelveLabs, we’re developing video-native AI systems that can solve problems with human-level reasoning. Helping machines learn about the world — and enabling humans to retrieve, capture, and tell their visual stories better.

Our Research

Marengo

3.0

Powered Features

Where power meets potential.

Introducing ‘any-to-any’ search

Marengo’s state-of-the-art ‘any-to-any’ search helps you pinpoint exact moments in vast video libraries, or allow customers to find any video moment within your platform.

Learn more

Embed

Introducing ‘rich embeddings’

With Marengo, it’s easy to build complex features like semantic search, hybrid search, anomaly detection, and more.

Learn more

Marengo is moving   at lightning speed.

What’s new

Tutorials

Marengo’s latest breakthroughs.

Semantic Content Discovery for a Post-Production World

Pegasus understands video and generates accurate descriptions and analysis.

A powerful intelligence interface, Pegasus can answer questions, generate creative outputs, and provide detailed analysis of any video.  

Simply describe what you need in natural language. Get marketing suggestions, high-impact captions, or even a child-friendly summary of a video instantly.

Our Research

Pegasus

1.5

Powered Features

Where words and moving image unite.

Analyze

Generate understanding with Pegasus

With video-to-text generation, Pegasus redefines how humans interact with video data. Intuitive, versatile, and powerful – this is human-level reasoning, at AI scale.

Learn more