Viral GPT-4o Images, Smarter Gemini 2.5, Qwen2.5 Real-Time Chat


In this newsletter:

  • ChatGPT's new GPT-4o image generator is so popular it's "melting GPUs"
  • Google debuts Gemini 2.5 that outsmarts rivals with built-in reasoning
  • Alibaba unveils Qwen2.5 Omni with real-time voice and video chat

Plus, you’ll find new AI tools and this week’s top AI news headlines!

In partnership with Paddle:

Implement Web2App Without the Complexity

The biggest Web2App roadblock? Finding the right payment solution.

Selling on the web means managing:

  • Global payments
  • Tax compliance in 100+ jurisdictions
  • Fraud protection and chargebacks
  • Billing support and refunds

Unless you use Paddle.

As a Merchant of Record, Paddle works like the app stores—but for the web:

✔ Same back-office coverage at lower fees
✔ Handles global payments & tax automatically
✔ Tools to reduce churn and boost retention

HubX prevented $110K in monthly churn with Paddle’s cancellation flows.

👉 Launch Web2App with less stress - start with Paddle

🖼️ ChatGPT's New GPT-4o Image Generator Is So Popular It's "Melting GPUs"

OpenAI has implemented temporary rate limits on its dramatically improved image-generation feature, with CEO Sam Altman revealing the system's popularity is pushing their hardware to the breaking point.

What's new:

  • GPT-4o powers dramatically improved image generation directly within ChatGPT
  • Creates photorealistic images with accurate lighting, textures, and readable text
  • Enables direct editing of existing images, including those with people in them
  • Handles complex scenes with up to 20 distinct objects
  • Excel at structured visuals like menus, diagrams, and infographics
  • Allows natural language or uploaded file input for modifications
  • Available to Free, Plus, Team, and Pro users (free tier limited to three generations per day)
  • Internally described as a "step change" above prior image models
  • Temporary rate limits introduced due to overwhelming user demand
  • Free tier access delayed and will soon be limited to three generations per day
  • Company working to increase system efficiency
  • Rate limit expected to be short-term as infrastructure catches up

Why it matters:

The extraordinary demand for ChatGPT's enhanced image capabilities demonstrates the impressive leap forward in AI generation quality.

This latest upgrade has proven so compelling that even OpenAI's substantial computing resources can't keep pace with user enthusiasm, showcasing both the technology's appeal and the computational challenges of advanced AI features.

👀 Read more about ChatGPT's latest image generation breakthrough!

🥇Google Debuts Gemini 2.5 That Outsmarts Rivals With Built-In Reasoning

Google DeepMind has launched Gemini 2.5, introducing a new class of "thinking models" designed to reason through problems before responding, with the first release - Gemini 2.5 Pro Experimental - claiming top spots across major AI benchmarks.

What's new:

  • Built-in reasoning capabilities that analyze and refine solutions before answering
  • Tops the LMArena leaderboard by a significant margin
  • Achieves state-of-the-art 18.8% on Humanity's Last Exam without tool use
  • Excel at creating visually compelling web apps and executable code from simple prompts
  • Scores 63.8% on SWE-Bench verified with custom agent setup
  • Maintains 1 million token context window (expanding to 2 million soon)
  • Preserves native multimodality for processing text, audio, images, and video
  • Available now in Google AI Studio and for Gemini Advanced users
  • Coming to Vertex AI in the coming weeks with pricing details to follow

Why it matters:

Google's shift toward models that can "think" before responding represents a significant advancement in AI problem-solving capabilities.

By building reasoning directly into its models rather than relying on external techniques, Google is positioning itself at the forefront of the race to create more sophisticated AI systems capable of handling increasingly complex tasks with human-like deliberation.

👀 Read how Google's new "thinking" AI model is changing the game!

🌐 Alibaba Unveils Qwen2.5 Omni With Real-Time Voice and Video Chat

Alibaba has released Qwen2.5 Omni, a revolutionary multimodal AI model that processes text, images, audio, and video while simultaneously generating both text and natural speech responses in real time.

What's new:

  • Innovative "Thinker-Talker" architecture for end-to-end multimodal perception
  • Processes multiple input types simultaneously (text, images, audio, video)
  • Generates streaming text and natural speech responses in real time
  • Uses novel TMRoPE position embedding to synchronize video with audio timestamps
  • Outperforms similarly sized single-modality models across benchmarks
  • Excels in speech recognition, translation, and image reasoning
  • Achieves state-of-the-art performance on OmniBench for multi-modal tasks
  • Matches specialized models like Qwen2.5-VL-7B and rivals Gemini-1.5-pro
  • Available now on Qwen Chat, Hugging Face, ModelScope, DashScope, and GitHub

Why it matters:

As AI models evolve, Qwen2.5 Omni represents a significant advancement by unifying multiple capabilities into a single, coherent system.

This integration enables more natural human-AI interactions by eliminating the need to switch between specialized models for different tasks.

The ability to process and respond to diverse inputs simultaneously mirrors human communication, bringing AI one step closer to truly conversational interfaces.

👀 Read how Alibaba is revolutionizing AI communication!

My Latest LinkedIn & X/Twitter Posts:

  • How to run influencer marketing campaigns 10x faster with AI (view post)
  • 10 YouTube channels to learn AI faster and smarter (view post)
  • 5 simple resume tweaks to land more job interviews (view post)
  • How to repurpose existing content across platforms faster with AI (view post)
  • How to turn your PDFs and slides into interactive video tutorials with AI (view post)
  • How one free AI extension helps me save hours every day (view post)

In partnership with Kit:

Monetize Your Audience With My Favorite All-in-One Tool 💡

As a creator, your time should be spent doing what you love — not juggling a dozen tools just to run your business.

Kit gives you everything you need in one place:

✅ Build and grow your email list (I use Kit for my newsletter)
✅ Easily monetize with paid newsletters and digital products
✅ Automate your emails with triggers and custom workflows
✅ Track what’s working and optimize with powerful insights

It’s not just email. It’s your entire creator business, simplified.

Join a thriving community of successful creators.

Use Kit for Free — and Start Building Smarter

New AI Tools to Boost Your Productivity:

  • GPT-4o Image Generation: Generates realistic images from text prompts directly within GPT-4o.
  • Gemini 2.5: Google's latest AI model with enhanced reasoning and accuracy.​
  • Effie: Combines note-taking, outlining, and mind mapping with Markdown support.
  • new.email: Offers customizable email templates for different use cases.
  • Jotform Boards: Converts form responses into tasks for collaborative management.​
  • Podcastle AI Voices: Turns text to speech using over 1,000 lifelike voices across various accents and styles. ​
  • Zapier MCP: Lets AI assistants perform tasks across 7,000+ apps without API setup.
  • Supercut: Records screen in up to 4K with auto-chapters and shareable layouts.
  • PocketLink: Turns your profile link into a customizable business tool with tracking and automation.
  • LiftmyCV: Automates job searching and applications across multiple platforms.
  • Flexpoint: Enables creators to sell products, services, and digital content without coding.
  • Eddie AI: Automates video editing tasks like rough cuts and A/B roll logging.
  • NeuraVid: Transcribes, analyzes, and searches video content with smart chapters & speaker detection.
  • MiriCanvas: Provides templates for creating presentations, social media posts, and posters.
  • Cloudairy: Creates animated diagrams, mind maps, and architecture designs.
  • Cobbai: Deploys AI agents for customer support and workflow automation.
  • AudioX: Converts text, images, and videos into professional audio, including music and sound effects.
  • Remio: Captures, organizes, and connects information from notes and web research.
  • WorkFlawless: Manages company workflows, SOPs, and team collaboration in one place.
  • Keyla: Generates realistic user-generated content (UGC) style video ads for social media platforms. ​

This Week's Top AI News Headlines:

  • Amazon Photos Integrates Shopping Links into User Photo Libraries, Enabling Direct Purchases via Image Recognition (View Article)
  • AI-Generated Video Ads Are Set to Make Personalization More Targeted and Intrusive with Infinite Variations for Each Viewer (View Article)
  • OpenAI’s ChatGPT Sparks Viral Trend of Studio Ghibli-Style AI Images, Prompting Legal and Ethical Questions About Artistic Ownership (View Article)
  • Paris-Based Startup Twin Launches First AI Agent for Qonto to Automatically Retrieve and Upload Invoices from Service Providers, Saving Businesses Hours Each Month (View Article)
  • Google Adds AI-Powered Vacation Planning Tools to Search, Maps, and Gemini with Itinerary Creation, Screenshot Scanning, and Hotel Price Tracking (View Article)
  • Microsoft CEO Satya Nadella Sets DeepSeek’s R1 Model as New AI Benchmark, Urges Teams to Match Its Speed and Efficiency on Azure (View Article)
  • Meta Accused of Training AI Models on Unpublished Books, Raising Concerns Over Author Rights and Data Privacy (View Article)
  • OpenAI Adopts Anthropic’s Model Context Protocol to Improve AI Access to External Data Across ChatGPT and Agents SDK (View Article)
  • xAI Brings Its Grok AI Assistant to Telegram, Expanding Access Beyond X as Part of Broader Integration Strategy (View Article)
  • Krisp Launches Real-Time AI Accent Converter to Translate Indian Accents into American English for Clearer Communication on Zoom and Teams (View Article)
  • Waze Drops Google Assistant Support on iPhone and Plans to Introduce Gemini-Powered Voice Integration Instead (View Article)
  • Microsoft Adds Researcher and Analyst AI Agents to Microsoft 365 Copilot to Automate Research, Data Analysis, and Workflow Tasks for Enterprises (View Article)
  • Character AI Launches Parental Insights to Let Teens Share Weekly Chatbot Usage Reports with Parents Without Revealing Conversation Content (View Article)

Work With Me:

If you enjoy this newsletter, please forward it to your friends and colleagues.

Follow me on LinkedIn and X/Twitter to see my latest posts.

Have a wonderful week!

Andrew Bolis

AI Tips for Business Growth - Newsletter by Andrew Bolis

Learn how to use AI to build, market and grow your business. Subscribe to get AI tips, tools and prompts in your inbox to power your marketing, sales and business.

Read more from AI Tips for Business Growth - Newsletter by Andrew Bolis

In this newsletter: Make $500 per day with 3 AI side hustles (step-by-step guide) ChatGPT prompts to find a job, learn faster & 2X productivity How to use ChatGPT to improve your public speaking skills (tutorial) New AI tools to explore and mid-week AI news headlines In partnership with Outskill & GrowthSchool: Join The World’s First 2-Day LIVE AI Bootcamp for $499 FREE 🤖 AI is no longer the future. It’s here, and it's revolutionizing every industry & the way we work. Introducing a 2-Day Free...

In this newsletter: Google launches canvas and audio overviews for Gemini, creating documents and AI podcasts Anthropic finally brings web search to Claude, catching up to ChatGPT and others Openai launches its most powerful reasoning model with o1-pro for complex problems Plus, you’ll find new AI tools and this week’s top AI news headlines! In partnership with Guidde: Describe a Process Once, and Let AI Take Care of the Rest 🎥 Are you spending hours answering repetitive questions from your...

In this newsletter: Make $13K/month by repurposing viral videos into YouTube shorts ChatGPT prompts to find a job, learn faster & 2X productivity How to use ChatGPT web search for instant answers (tutorial) New AI tools to explore and mid-week AI news headlines In partnership with Paddle: Web2App: The Smarter Way to Sell Your App Building Web2App isn’t just about a landing page—it requires a complete revenue strategy. To make it work, you need: ✔ A web experience that converts✔ A way to sync...