GPTZero’s technology uses deep learning to keep pace with AI advancements to deliver precise, reliable results that help you understand and interpret the origin of a piece of text.
GPTZero accepts copy and pasted text, docx, pdf, and image files, analyzing up to 50 files at a time.
We employ an end-to-end deep learning approach, trained on text datasets from the web, education, and AI- generated from a range of LLMs.
A sentence-by-sentence classification model determines the probability and confidence that a text was created by AI.
We defend against tools looking to exploit AI detectors. Our model shields against common methods to bypass AI detection, such as paraphrasing and homoglyph attacks.
You can view easy-to-interpret results in our dashboard, with premium features to detect AI vocabulary, plagiarism, and citeable sources.
It is becoming increasingly critical to develop robust tools to detect AI-generated texts and limit the adverse effects of LLMs. GPTZero’s mission is to ensure that human-authored and LLM-generated text remains distinguishable. We achieve this goal by offering a commercially available AI detector that is highly accurate, scalable, and – most importantly – capable of delivering explainable predictions that allow users to responsibly interpret the results.
Our wider research contributions include:
We frame the LLM-generated text detection as a trinary classification problem, separating prediction confidence from the proportion of LLM text.
We developed the first sentence highlighting model using HMM (Hidden Markov Models) for areas of text, featured on Anderson Cooper 360.
We developed a novel output mapping mechanism which improves model calibration and biases the detector to prefer making less-harmful false-negative errors over false-positive errors.
We continuously demonstrate superior AI detection performance against both commercial and open-source alternatives across multiple genres and languages.
We outlined an industrial-scale framework for collecting and cleaning data, training and utilizing supervised-models, and considerations on user interaction with the models.
Our team is dedicated to de-biasing our AI classification models for educational use cases.
For example, our efforts in reducing ESL bias in classification since April 2022 have reduced AI detection’s false positive rate on TOEFL texts to 1.1%.
We achieved our successful de-biasing via several methods, including model parameter tagging that incorporated an “education” tag in model training, text preclassification at the model output step, and representative dataset insertions. Through training a classification model, we can predict beforehand whether a text is likely from an ESL writer, to ensure the AI identification model has this information when making a classification.
We were the first detector to provide confidence categories for our classifications: “uncertain,” “moderately confident,” and “highly confident.” These categories are tuned so that the average error rate is less than 1% for the “high” confidence predictions, based on a diverse evaluation dataset used internally that was never before seen by the model.
Average error rate is emphasized because the number of possible documents is vast, varying substantially in tone, content, length, grammatical correctness, logical coherence, and structure.
GPTZero was the first detector to include a classification of “mixed” human and AI content. Our model outputs 3 possible classifications instead of the normal binary (human vs. AI):
This allows for a more nuanced AI detection result.
We are strongly supportive of the work of independent and academic reviewers in evaluating the progress of AI models.
We provide free API access to our model upon request for academic researchers. We’ve been evaluated by researchers from MIT, Harvard, Stanford, and several other universities.
From internal and external benchmarking, we find GPTZero is much better than our competitors at detecting mixed documents where both AI and human writing is involved, with a 96.5% accuracy rate.
A false positive in AI detection is when an AI detector incorrectly classifies a human’s writing as AI. If, for instance, you are an educator or an institution that relies on AI detection tools to help inform your disciplinary policy around students’ AI usage, you will want to make sure the false positive rate is as low as possible to avoid false claims of cheating. We keep GPTZero’s false positive rate at no more than 1% when evaluating AI versus human text.
Join your fellow researchers using GPTZero for their papers, publications, and investigations.
Giuseppe Russo Latona, Manoel Horta Ribeiro, Tim R. Davidson, Veniamin Veselovsky, Robert West
"We estimate that 15.8% of ICLR reviews in 2024 were crafted with the assistance of an LLM, or 4,428 of the 28,028 reviews submitted that year; 49.4% of all submissions received at least one review classified as AI-assisted by GPTZero."
Pablo Picazo-Sanchez & Lara Ortiz-Martin
"In other words, no matter which editorial the analysed text comes from, the detector with the highest accuracy is GPTZero."
Frederick M. Howard, Anran Li, Mark F. Riffon, Elizabeth Garrett-Mayer, and Alexander T. Pearson
"GPTZero had the best discrimination of the pure AI-generated abstracts at an optimal threshold selected with Youden’s index, identifying 99.5% of AI-written abstracts with no false positives among human-written text. AI, artificial intelligence."
Creston Brooks, Samuel Eggert, Denis Peskoff
"Using two tools, GPTZero and Binoculars, we detect that as many as 5% of 2,909 English Wikipedia articles created in August 2024 contain significant AI-generated content."
Everything you need to know about GPTZero and our chat gpt detector. Can’t find an answer? You can talk to our customer service team.
Can’t find an answer? You can talk to our customer service team.
GPTZero is the first and leading AI detector that allows you to identify specific content in a document or text that has been generated by a large language model (LLM), such as ChatGPT. Founded in January 2023, GPTZero has served over 10 million users to date and works with 100+ organizations in education, hiring, publishing, legal and more.
AI detectors work by looking for patterns in text that are more likely to have been written by a machine than a human. The most common factors they look for include:
GPTZero uses its own proprietary model that takes hundreds of factors into consideration and boasts the highest detection accuracy in the industry.
Yes. Independent benchmarks, like our partners at Penn State’s AI Research Lab, and our own large-scale testing show that GPTZero is the most accurate AI detector, with a 99% accuracy rate when spotting AI-generated text vs. human writing, a false negative rate under 2%, and a false positive rate under 1%. This means we correctly classify AI writing 99 out of 100 times, and keep misclassification of text to an absolute minimum.
Unlike many competitors, GPTZero can reliably detect “mixed documents” (where human and AI writing are combined) with 96.5% accuracy rate. While no AI detector can ever truly be 100% perfect, we are committed to being a leader in responsible adoption of AI generation and AI detection technologies.
You can start using GPTZero straight away in 3 easy steps:
GPTZero is the most trusted AI content checker in education, both for teachers and students who use GPTZero to guide conversations on academic integrity and writing improvement. GPTZero is used in over 3500 colleges and 100s of institutions. We also serve publishers, recruiters, copy-writers, marketers and businesses where authentic and quality writing truly matters.
No. GPTZero works across a wide range of AI language models, including but not limited to ChatGPT, GPT-5, GPT-4, GPT-3, Gemini, Claude, Llama, Deepseek, and AI services based on those models. Our detector is designed for the highest accuracy rate regardless of which model may have produced the text.
In our latest model release, we’ve updated the majority of our training data, including more AI documents from: OpenAI – GPT4.1,GPT4.1-mini, o3, o3-mini Gemini – 2.5 Pro, 2.5 Flash, 2.5 Flash-Lite Claude, Sonnet 4 as well as a few other models.
GPTZero fully supports English, German, Portuguese, French and Spanish with industry-leading accuracy. We are also constantly used in many other languages and in over 100 countries.
GPTZero is consistently rated the most accurate and reliable AI detector across various use cases.
No AI detector is 100% accurate, and AI itself is changing constantly. Results should not be used to punish or as the final verdict. Accuracy improves with longer inputs (document-level results are stronger than paragraph or sentence-level) and is strongest on English prose.
GPTZero is the only AI detector de-biased for ESL (English Second Language) learners, which many competitors do not take into account. We repeatedly train our model to reduce false positive rate for ESL writing to 1%. We also monitor our datasets to transparently prevent any bias in AI detection training data.
Our model is trained on millions of documents spanning various domains of writing including creating writing, scientific writing, blogs, news articles, and more. We test our models on a never-before-seen set of human and AI articles from a section of our large-scale dataset, in addition to a smaller set of challenging articles that are outside its training distribution.
To see the full schema and try examples yourself, check out our API documentation.
Our API returns a document_classification
field which indicates the most likely classification of the document. The possible values are HUMAN_ONLY
, MIXED
, and AI_ONLY
. We also provide a probability for each classification, which is returned in the class_probabilities
field. The keys for this field are human
, ai
or mixed
. To get the probability for the most likely classification, the predicted_class
field can be used. The class probability corresponding to the predicted class can be interpreted as the chance that the detector is correct in its classification. I.e. 90% means that 90% of the time on similar documents our detector is correct in the prediction it makes. Lastly, each prediction comes with a confidence_category
field, which can be high
, medium
, or low
. Confidence categories are tuned such that when the confidence_category
field is high
99.1% of human articles are classified as human, and 98.4% of AI articles are classified as AI.
Additionally, we highlight sentences that been detected to be written by AI. API users can access this highlighting through the highlight_sentence_for_ai
field. The sentence-level classification should not be solely used to indicate that an essay contains AI (such as ChatGPT plagiarism). Rather, when a document gets a MIXED
or AI_ONLY
classification, the highlighted sentence will indicate where in the document we believe this occurred.
No. We do not store or collect the documents passed into any calls to our API. We wanted to be overly cautious on the side of storing data from any organizations using our API.
However, we do store inputs from calls made from our dashboard. This data is only used in aggregate by GPTZero to further improve the service for our users. You can refer to our privacy policy for more details.