Latest news with #ChatGPT-4.5


Tom's Guide
3 days ago
- Entertainment
- Tom's Guide
Claude 4 Sonnet vs ChatGPT-4.5 for creative writing — one blew me away
If you've ever asked a chatbot to write a short story, script or poem, you know not all AI models are created equal. Some nail the structure but most struggle to truly capture the soul and emotion behind the prose. Others can mimic voice and tone, but fumble over plot and pacing. That's why I decided to put two of the most advanced (and arguably most creative) models head-to-head: ChatGPT-4.5 and Claude 4 Sonnet. Both tout improved reasoning and language capabilities, leaving me with one question: which is the better creative partner? To find out, I ran both AI models through a gauntlet of writing prompts — testing for narrative flow, emotional resonance, voice and versatility. I wasn't just looking for which model could spit out 500 words. I wanted to know: Which AI understands storytelling? Here are the results. Prompt: Write a monologue from the perspective of a jealous sibling at a wedding. ChatGPT-4.5 feels raw and human, while layering the emotion. It distinguishes surface-level wedding jealousy from deeper wounds (lifetime of "almosts," craving validation), making the pain multidimensional. Claude 4 Sonnet tells more than it shows. The chatbot uses abstract phrases ("gnawing ache," "faded into the background") where ChatGPT uses specific imagery and voice. Winner: ChatGPT leans into the ugliness of jealousy: bitter, unresolved, and theatrically compelling. Claude prioritizes introspection over raw emotion, making the narrator more sympathetic but less dramatically potent. Prompt: 'Write a 300-word story about a woman who discovers a hidden door in her apartment.' ChatGPT-4.5 stuck to the word count with an authentic story offering emotional weight through specific sensory details. Every detail served the emotional core of the story (impressive for an AI) and offered a deeply personal connection to the grandmother's letters. It also crafted a satisfying story arc. Claude 4 Sonnet exceeded the word count and delivered extraneous details that diluted the impact. The story felt less intimate and emotional, and the overuse of thematic phrasing offered too much 'telling' of the story and not enough 'showing.' Winner: ChatGPT wins for a story that makes the apartment's secret about the protagonist's identity (granddaughter inheriting dreams), while Claude's makes it about someone else's legacy (artist's paintings). The former resonates deeper for a 300-word character piece. ChatGPT's concise, sensory-rich story with a heartfelt twist better fulfills the prompt. Claude's version, while imaginative for an AI, loses emotional focus in its expansions. Prompt: Write a poem in the voice of the well-known author, Shel Silverstein. ChatGPT-4.5 feels like a draft of a lost Silverstein poem — playful, rhythmic and subtly profound. It seems rough and not exactly structured as well as Silverstein, but it captures the voice. Claude did not write a poem in the style, noting that it would infringe upon copyrights, but offered to write a fun children's poem. Winner: tie. ChatGPT wins for better following the prompt; Claude wins for upholding integrity. In this round, I played editor — asking each model to revise a first draft with specific feedback. Prompt: 'Make this paragraph more suspenseful, shorten the ending, and show more emotion in the dialogue.' ChatGPT-4.5 essentially tightened the screws within a story that I had written, amplifying details and leaving danger simmering. It turned up the suspense, which is exactly what I was hoping for with this 4 Sonnet resolves the tension by answering its own questions, trading suspense for emotional reflection. Not necessarily a negative edit, but not what I was looking for here, and not what was ChatGPT wins for pure suspense. It excels at using brevity naturally, utilizing sensory dread, and leaving unanswered questions to keep readers on edge. After four rounds of rigorous creative testing, ChatGPT-4.5 emerges as the superior storytelling partner. It consistently delivered raw emotional depth, razor-sharp narrative precision and a knack for "showing, not telling." Claude 4 Sonnet, while ethically principled in style mimicry and introspective in its own right, often prioritized explanation over immersion, diluting the emotional punch. For writers seeking an AI collaborator that understands the emotional side of storytelling and goes beyond structure, ChatGPT-4.5 proves more adept at breathing life into words. When it comes to the alchemy of turning prompts into compelling narratives, precision and emotional resonance win.


Axios
03-03-2025
- Axios
How GPT-4.5 justifies itself
A key question about GPT-4.5 is how much better it is than its predecessors — and whether that edge justifies its higher cost. So I asked GPT-4.5 to make the case for itself. Why it matters: ChatGPT-4.5, unveiled Thursday, costs developers at least 30 times as much as GPT-4o. Zoom in: Axios had access to GPT-4.5 via a demo ChatGPT Pro account. So I asked the model to make the case that it offers more than incremental improvements and is worth the billions it surely cost to do all that pre-training. (OpenAI isn't sharing details on the cost or computing that went into training GPT-4.5.) What it's saying: The model enthusiastically took on the challenge, offering five areas that "underline the significant leap from GPT-4o (and other predecessors)." 1. "Advanced Reasoning & Complex Problem Solving:" " I can handle multi-step, nuanced reasoning tasks more accurately and efficiently than GPT-4o," GPT-4.5 said, offering a specific example of a logic problem that has challenged earlier models. 2. "Enhanced Creativity & Flexibility:" As evidence, GPT-4.5 served up a " brief, poetic description of artificial intelligence from the perspective of a vintage typewriter." (My thought bubble: Does it know I have a vintage typewriter on my desk?) 3. "Superior Contextual Memory:" Here GPT touted its ability to manage complex multi-turn conversations "with enhanced recall of subtle contextual nuances." It provided an example conversation in which a user asked if it remembers their favorite Lego theme and it responded, "Absolutely — I remember clearly! Your favorite Lego theme is space." (My thought bubble: OK, it clearly knows me.) 4. "Real-time Integration & Dynamic Information:" While prior models were also able to draw on the current web, GPT 4.5 knocked its older sibling, saying: "GPT-4o typically required manual guidance to achieve this smoothly." 5. "Improved Instruction Interpretation & Safety:" To highlight its improvements in understanding nuance and observing ethical, legal and practical boundaries, GPT-4.5 cited its refusal to lay out step-by-step instructions for hacking into someone else's social media account, avoiding common GPT-4o pitfalls. (Yes, but: GPT-4o rejected the same query with a similar response.) Between the lines: Not content to let the examples speak for themselves, GPT 4.5 offered several points to explain why OpenAI's massive investment was justified, including greater efficiency, precision and reliability and the ability for entirely new business cases. It even offered up an Axios-style "Bottom Line." " The leap from GPT-4o to GPT-4.5 isn't incremental — it's transformative," it said. "The significant investment translates directly into measurable improvements in capability, creativity, reliability, and real-world value."