AI News Update: OpenAI Spring Event and Google IO Announcements

May 18, 2024

AI News Update: OpenAI Spring Event and Google IO Announcements 🎉

OpenAI Spring Event Highlights

  • OpenAI Spring Update
    • Released on Monday.
    • Full breakdown video available titled "Chat GPT's Amazing New Models Feel Human and it’s Free."

GPT-40 Announcement

  • GPT-40 (Omni)
    • Multimodal model handling text, audio, video, and images.
    • Faster, better model compared to the previous GPT-4 model.
    • Availability:
      • Free for all ChatGPT users with advanced features available (vision models, internet browsing, memory, etc.).
      • ChatGPT Plus Members: Early access to features with five times more outputs.

Developer Features

  • API Features
    • GPT-40 API is twice as fast, 50% cheaper, and has a five times higher rate limit.

Conversational & Voice Capabilities

  • Her-like Features
    • Demo showcasing conversational abilities resembling Scarlet Johansson’s voice from the movie "Her."
    • Math problem-solving demo using image recognition and showing step-by-step solutions.
    • New desktop app with advanced functionalities (e.g., accessing screen interactions).
    • Ability to change speaking emotions and voices during interaction.
    • Recognizes facial expressions to understand user emotions.

Image Generation & Other Capabilities

  • Image Generation: E.g., generating legible text in images (whiteboard).
  • Character Consistency: Ability to use same generated character across different scenarios.
  • Miscellaneous Features: Photo to caricature, text-to-font, 3D object synthesis, brand placement.

Ilia Sutskever's Departure from OpenAI

  • Ilia Sutskever Leaving OpenAI
    • Co-founder, pivotal in Sam Altman’s firing and return had stepped down.
    • Speculation: Discontent with commercialization direction of OpenAI.
    • Left on good terms; planning a meaningful new project.
  • Resignations Following Ilia
    • Key members of the super alignment team also resigned.

Google IO Highlights

Gemini 1.5 Release

  • Gemini 1.5
    • Two versions: Pro (more intelligent) and Flash (faster).
    • Both versions have a 1 million token context window with plans for 2 million.

Project Astra

  • Project Astra Demo
    • Mobile phone identifying objects and remembering visual information.
    • Functions similar to a heads-up display; evident in the glasses shown in demo.

Notebook LM and Image Generation

  • Notebook LM: Organizes and explains user’s data in a podcast-style interaction.
  • Imagine 3: New text-to-image model with enhanced realism, approaching mid-journey level.
  • Veo Video Generation Model: Generating 1080P+ minute-long videos, available soon.

AI Integrations in Google Suite

  • Google Search: Enhanced with multi-step reasoning and context-based responses.
  • Gmail and Google Meet: AI features for summarizing and organizing email content.
  • Ask Photos: Context-aware responses based on photo content (e.g., identifying swimming lessons).
  • Scam Detection in Phone Calls: Detects and warns users about possible fraud.

Additional News

Meta Exploring AI Earbuds

  • AI-assisted Earphones: Meta working on concept similar to AR glasses but in earbud form.

Microsoft Build Event Upcoming

  • Expectations
    • Likely announcements about integration of GPT-40 across Microsoft products.
    • Co-pilot features enhanced with voice and GPT-40 functionalities.

Other Companies

  • Anthropic: Hired Instagram co-founder Mike Krieger as Chief Product Officer.
  • Hume: Released "Chatter," an interactive podcast experience.

Closing Notes

  • Event Season: Multiple upcoming AI events including Microsoft Build, Cisco, Qualcomm, and Apple WWDC.
  • Newsletter and Giveaway: Subscribe to stay updated on AI news and win tech gadgets.
  • Thank You: Appreciating viewers and subscribers for their engagement.

Stay tuned for more updates from the AI world! 🤖