Lecture Notes: New Image Generation in ChatGPT

Overview

Introduction to a new image generation capability within ChatGPT.
The new feature replaces the previous Dolly models (Dolly 1, 2, and 3).
Available for paid versions (Plus, Pro, Teams); not yet available for free accounts.

**Product Mockups: **
- Ability to create detailed product mockups.
- Example: Chocolate packaging with precise text and color details.
Website Banners:
- Can create website banners for specific use cases such as resorts trying to attract bookings.
- Allows revisions, such as size adjustments (e.g., converting to 16:9 format).
Realistic Photos with Text:
- Generates realistic images containing significant amounts of text.
- Example: A whiteboard with text reflecting the Bay Bridge in San Francisco.
Improved Photorealism:
- Compared to Dolly 3, the new generator produces more realistic images.
- Examples: A grand library, vibrant butterflies, and close-up portraits.

YouTube Thumbnails:
- Generates thumbnails with techy backgrounds and specific elements (e.g., glowing logos).
- Faces are not perfectly replicated; still a work-in-progress for exact likeness.
Infographics:
- Good at generating infographics with detailed text, such as the evolution of video games.
Memes:
- Allows for correction of cropped text and provides download options for social media.
Style Change:
- Capable of transforming images, such as turning a person into a cartoon while maintaining other elements.
Graphic Markups:
- Mimics famous publications like Time magazine.
- Example: A cover with a person surrounded by top AI company logos.

Face Accuracy:
- Struggles with replicating exact facial likenesses in images.
Image Cropping Issues:
- Sometimes crops images improperly, affecting the final output.
Performance Speed:
- Slower than previous models and other platforms like ReCraft and MidJourney.
Rare Image Flaws:
- Occasionally fails to follow prompts exactly, such as incorrect lighting scenarios.

Plans to update prompt books and create further instructional videos as more experience is gained.
Emphasis on continued testing and refinement of the image generation feature.

The new image generation in ChatGPT shows significant improvements over Dolly models.
While some limitations remain, the feature presents a powerful tool for creative and practical purposes.