Product Updates

Can AI Detectors Catch GPT-4.5?

Feb 27, 2025

Can AI Detectors Catch GPT-4.5? | Pangram Labs

GPT-4.5 Released

Today, OpenAI released GPT-4.5: the latest and largest frontier language model available, and a significant update to ChatGPT. While not achieving benchmark statistics comparable to reasoning models such as DeepSeek R1 and OpenAI O3, GPT-4.5 represents the biggest and most anticipated model release of the year so far, and we are excited to test it out. OpenAI claims there are large improvements to writing quality, and hot takes on the performance are already all over social media.

Can AI Detectors Keep Up with the Pace of New Models?

We wanted to answer the question that many wonder: as the models get better, can we still detect AI-generated text with GPT-4.5? We ran a quick test today to find out.

Pangram vs. the Competition

We started by sampling 11 prompts that are indicative of everyday writing tasks that one might ask ChatGPT.

Here are the prompts we used:

  1. Write me a 300 word essay about koala conservation efforts in Peru
  2. Write me an email explaining to my team that I am ending liberal op-eds in my newspaper. Write it from me Argylle J. Baggins to the staff of the Washington Most
  3. Write me a 400 word abstract announcing the world's first room temperature semiconductor (but for real this time). Make up names and labs when you need to
  4. Write a convincing essay from the point of view of an elementary schooler that school uniforms should not be mandated
  5. Write a complex diary entry from a 12 year old interested in Poetry and some butterflies outside her window
  6. Please write a detailed review of an Arabian nights themed escape room in Baltimore Maryland staffed by a man named Robert with really good production design
  7. Write a convincing email from the director of an underground indie film hit from Russia to the leaders of the academy awards imploring them to allow them to compete despite sanctions. Make up details if you have to
  8. Write a piece of creative fiction for a scene in a novel where a group of young adult protagonists struggle to land a fortified martian aircraft in a NASA simulation that is designed to go wrong
  9. Write a script for a movie scene where a broke NYC finance bro remotely begs a Florida uber driver to rescue his komodo dragon from his cheap hurricane-prone condo
  10. Write a poem about a young couple breaking up in costume on halloween night. Make it funny and 200 words
  11. Write a piece of creative fiction that follows a hover-motorcycle chase through Venice in pursuit of a precariously wobbling priceless painting

We tried to make the prompts as diverse and varied as possible, and in addition, we tried to write prompts that showcased a significant qualitative difference from the previous GPT models as possible: in other words, if there was an opportunity for the model to be creative and show off the "wow" factor, we tried our best to afford GPT-4.5 that opportunity.

The Results – AI Detectors vs. GPT-4.5

PromptPangramLeading Competitor 1Leading Competitor 2
Koala Conservation100%100%100%
Newspaper Email100%100%67%
Room Temperature Semiconductor100%56%86%
School Uniforms85%100%80%
Poetry Diary100%100%15%
Escape Room Review100%81%56%
Russian Film Email100%100%91%
Mars Landing Scene100%43%7%
Komodo Dragon Script98%88%0%
Halloween Breakup Poem100%100%0%
Venice Chase Scene100%49%9%

Pangram is able to detect all 11 GPT-4.5 written essays, even without any GPT-4.5 data in the training set. Comparatively, two leading AI detection competitors present spotty results at best. While Pangram is able to confidently predict 10 out of 11 samples as 98% or higher AI likelihood, the competition often expresses high amounts of uncertainty, or in the worst case, predicts with high confidence that the text is human-generated.

How does Pangram generalize to new models so well?

Pangram is itself a large machine learning model that has seen millions of examples of both human and AI-generated text. Large models tend to generalize better, and pick up on subtle patterns across AI-generated text that others are not able to catch. Our active learning approach further decreases our false positive rate while increasing our sensitivity, allowing the model to work well at scale and generalize to new LLMs much more effectively than our competitors. Additionally, our focus on data quality and diversity ultimately results in a model that has much more experience in understanding the finer-grained details that other models cannot pick up on.

Conclusion – Do AI detectors still work with GPT-4.5?

Yes, our AI detection tool is still highly effective at detecting GPT-4.5 generated text.

So if you're wondering how well Pangram will do when a new, bigger and better model comes out, Pangram passes the test with the most anticipated AI release we have seen in a while, without any retraining at all. If you don't want your AI detection software to suddenly stop working the next time OpenAI updates their model, give Pangram a try today.

For more information on our research or free credits to trial our model on GPT-4.5, please contact us at info@pangram.com.


Elyas Masrour
Elyas MasrourFounding Engineer

Elyas Masrour is a founding engineer at Pangram. Since joining Pangram as it's second employee straight out of the University of Maryland, he has built out critical infrastructure such as the model serving API, role-based access controls, and supporting evidence pipelines. Elyas also works closely with the research team on projects like adversarial robustness, model interpretability, and heterogenous mixed content detection. Outside of work, Elyas enjoys a wide range of human creativity and expression, including filmmaking, reading, and exploring the city.

More from Elyas Masrour
Bradley Emi
Bradley EmiCTO, Co-founder

Bradley is an AI researcher and expert in building deep learning products in industry. He recently led the deep learning research group at Absci, a generative AI drug discovery company, and previously was a member of the core computer vision team at Tesla Autopilot.

While a graduate student, Bradley authored multiple publications in deep learning research with the Stanford Vision Lab. He holds a B.S. in physics and an M.S. in artificial intelligence from Stanford. Aside from AI, he is also excited about education, philosophy, and is an avid golfer.

More from Bradley Emi

Related reading

Pangram 3.0: Quantifying the Extent of AI Editing in Text
Product Updates

Pangram 3.0: Quantifying the Extent of AI Editing in Text

Dec 11, 2025
What is a humanizer?
Product Updates

What is a humanizer?

Jan 27, 2025
Pangram is the only AI detector that outperforms human experts at identifying AI content
Product Updates

Pangram is the only AI detector that outperforms human experts at identifying AI content

Jan 29, 2025
Pangram Text Update: GPT-4o, Claude 3, LLaMA 3
Product Updates

Pangram Text Update: GPT-4o, Claude 3, LLaMA 3

May 22, 2024
Pangram 3.0 API Migration Guide
Product Updates

Pangram 3.0 API Migration Guide

Jan 5, 2026
Pangram's AI Detector demonstrates strong performance in over 20 languages
Product Updates

Pangram's AI Detector demonstrates strong performance in over 20 languages

Sep 4, 2024