🖼️

New Image Generation Features in ChatGPT

Apr 9, 2025

Lecture Notes: New Image Generation in ChatGPT

Overview

  • Introduction to a new image generation capability within ChatGPT.
  • The new feature replaces the previous Dolly models (Dolly 1, 2, and 3).
  • Available for paid versions (Plus, Pro, Teams); not yet available for free accounts.

Key Features

  • **Product Mockups: **

    • Ability to create detailed product mockups.
    • Example: Chocolate packaging with precise text and color details.
  • Website Banners:

    • Can create website banners for specific use cases such as resorts trying to attract bookings.
    • Allows revisions, such as size adjustments (e.g., converting to 16:9 format).
  • Realistic Photos with Text:

    • Generates realistic images containing significant amounts of text.
    • Example: A whiteboard with text reflecting the Bay Bridge in San Francisco.
  • Improved Photorealism:

    • Compared to Dolly 3, the new generator produces more realistic images.
    • Examples: A grand library, vibrant butterflies, and close-up portraits.

Practical Use Cases

  • YouTube Thumbnails:

    • Generates thumbnails with techy backgrounds and specific elements (e.g., glowing logos).
    • Faces are not perfectly replicated; still a work-in-progress for exact likeness.
  • Infographics:

    • Good at generating infographics with detailed text, such as the evolution of video games.
  • Memes:

    • Allows for correction of cropped text and provides download options for social media.
  • Style Change:

    • Capable of transforming images, such as turning a person into a cartoon while maintaining other elements.
  • Graphic Markups:

    • Mimics famous publications like Time magazine.
    • Example: A cover with a person surrounded by top AI company logos.

Limitations

  • Face Accuracy:

    • Struggles with replicating exact facial likenesses in images.
  • Image Cropping Issues:

    • Sometimes crops images improperly, affecting the final output.
  • Performance Speed:

    • Slower than previous models and other platforms like ReCraft and MidJourney.
  • Rare Image Flaws:

    • Occasionally fails to follow prompts exactly, such as incorrect lighting scenarios.

Future Updates

  • Plans to update prompt books and create further instructional videos as more experience is gained.
  • Emphasis on continued testing and refinement of the image generation feature.

Conclusion

  • The new image generation in ChatGPT shows significant improvements over Dolly models.
  • While some limitations remain, the feature presents a powerful tool for creative and practical purposes.