🎤

Gladia's Real-Time Audio Transcription Breakthroughs

Apr 9, 2025

Gladia and Real-Time Audio Transcription APIs

Overview

  • Company: Gladia, a French startup
  • Focus: Real-time processing in audio transcription APIs
  • Funding: Raised $16 million in a Series A funding round
  • Competitors: Amazon, Microsoft, Google, AssemblyAI, Deepgram, Speechmatics

Gladia's Offerings

  • Provides a speech-recognition API
  • Converts audio files to text with high accuracy and low turnaround time
  • Supports 100 languages and various accents
  • Includes diarization to distinguish between multiple speakers

Use and Adoption

  • Used by over 600 companies
  • Integrated into applications like meeting recorders and note-taking assistants (e.g., Attention, Circleback)
  • Helps in generating knowledge from text through LLMs like GPT-4o and Claude 3.5 Sonnet

Real-Time Processing

  • Current Challenge: Improve real-time transcription quality
  • Solution: Aim for batch quality in real-time processing
  • Performance: Achieves transcription with latency under 300 milliseconds
  • Use Cases: AI calling agents, call centers for real-time information retrieval

Gladia's Vision

  • Integration of audio intelligence and LLM tasks in a single API call
  • Simplifying the process of speech-to-text conversion and knowledge extraction without third-party LLMs

Future Prospects

  • Anticipates a "ChatGPT moment" for audio applications
  • Expected increase in consumer awareness as transcription models are included in mobile OS
  • Potential for developers to enhance products with audio features using Gladia's APIs

Key Players Involved

  • Founders: Jean-Louis Quguiner (CEO), Jonathan Soto (CTO)
  • Lead Investor: XAnge
  • Other investors: Illuminate Financial, XTX Ventures, Athletico Ventures, etc.

Industry Impact

  • Gladia positions itself ahead by focusing on real-time transcription capabilities
  • Aspires to match or exceed the quality of established APIs by leveraging advanced technologies

Strategic Goals

  • Enhance the quality of real-time processing
  • Maintain compatibility with existing tech stacks and protocols (SIP, VoIP, FreeSwitch, Asterisk)
  • Broaden the application of their API across various sectors, enhancing automation and efficiency