Failing Grades Rise as Students Turn to ChatGPT for Assignments

Berkeley computer science courses recorded a noticeable uptick in failing grades over recent semesters, coinciding with the widespread adoption of ChatGPT and similar AI tools among undergraduates. Instructors observed that students who leaned heavily on AI for problem sets struggled to reproduce solutions during proctored exams without digital assistance. The pattern raises uncomfortable questions about how universities measure actual learning when AI can generate passable work on demand.

TL;DR: AI tools like ChatGPT are linked to rising failing grades and declining math skills at universities including Berkeley. Students rely on AI for essays and problem sets, but struggle to demonstrate knowledge without it. According to a 2024 survey cited by Academia PAN, over 80% of students at some institutions admit to using generative AI for coursework. The core issue is outdated assessment, not the technology itself.

Are Failing Grades Increasing Because Students Use AI Tools?

Yes, there is growing evidence that AI tool usage correlates with higher rates of failing grades, particularly in STEM courses at institutions like UC Berkeley. The problem is not that AI makes students less intelligent — it is that AI creates a false sense of competence that collapses during traditional exams. Students submit polished homework generated with AI assistance, then freeze when asked to solve similar problems on paper under time pressure.

A report discussed by Academia PAN highlights that essays, reports, and projects have been foundational assessment tools for years. Generative AI makes it increasingly difficult to determine what a student actually produced versus what a machine generated. When homework grades inflate but exam performance drops, the result is a widening gap between perceived and actual knowledge.

The data from Berkeley CS courses is telling. Instructors noticed that sections where AI usage was most prevalent during homework assignments saw the sharpest declines in midterm and final exam scores. Students who used AI to complete problem sets often could not trace the logical steps behind their own submitted solutions. This is not subtle.

The mechanism is straightforward. AI tools provide correct answers without requiring the intermediate reasoning that builds durable understanding. When a student copies an AI-generated proof without working through each step, they miss the struggle that creates neural pathways for retention. The homework looks perfect. The exam tells a different story.

Antyweb reports that Polish universities face a similar pattern. Students admit to using AI for assignments regularly, yet professors note that classroom discussion and impromptu questioning reveal significant gaps in foundational knowledge. The grades on submitted work do not match observable competence.

Pulshr.pl coverage reinforces this trend, noting that young people increasingly cannot imagine studying without AI. The article points out that while AI helps with learning, data analysis, and project organization, it also creates dependency. When the tool becomes unavailable — during an exam, a job interview, or a live coding challenge — the cracks become visible. Failing grades are the symptom, not the disease.

How Is AI Changing the Way Students Learn Mathematics?

AI is fundamentally altering mathematical skill development by allowing students to bypass the manual computation and logical reasoning that build core competencies. At Berkeley and other institutions, computer science students increasingly arrive at correct final answers for math-heavy problem sets using tools like ChatGPT, but cannot explain the derivation steps or apply the same techniques to slightly modified problems during exams.

Mathematics learning depends on repetition, struggle, and gradual pattern recognition. When AI provides the solution immediately, students skip the productive friction that creates lasting understanding. The result is a generation of learners who can recognize correct answers but cannot generate them independently. This creates a fragile form of knowledge.

The Antyweb article on Polish universities describes a parallel phenomenon. Students use AI to solve mathematical problems in physics, engineering, and economics courses. When asked to demonstrate their work orally, many cannot reconstruct the reasoning. Professors report that students sometimes do not even recognize the mathematical notation in their own submitted solutions.

Academia PAN’s analysis points to a deeper issue. The problem is not AI itself but the fact that mathematical education still relies on take-home problem sets as a primary assessment method. When students can input a problem into ChatGPT and receive a complete solution with steps, the homework ceases to function as a learning tool. It becomes a transaction.

The decline in mathematical skills manifests in specific ways. Students struggle with mental arithmetic that previous generations handled routinely. They have difficulty estimating whether an AI-generated answer is in the correct range. They lack the number sense that comes from years of manual calculation. When the AI makes an error — which happens regularly in mathematical reasoning — students cannot catch it.

Pulshr.pl notes that some educators are adapting by requiring students to solve problems live during class or using oral examinations. These methods force students to demonstrate real-time mathematical thinking without AI mediation. Early results suggest that students who practice without AI assistance perform significantly better on assessments that test genuine understanding.

The mathematical skill decline is not limited to any single country. The Vietnamese academic integrity article discusses how AI-generated mathematical solutions have become indistinguishable from student work, making it nearly impossible for instructors to identify who actually understands the material. The old model is broken.

Why Do Traditional Assessment Methods Fail in the AI Era?

Traditional assessment methods fail because they were designed for an era when producing text or solving problem sets required human effort and demonstrated human understanding. Essays, take-home exams, and homework assignments all assume that the act of production correlates with comprehension. Generative AI breaks this assumption completely.

Academia PAN explicitly states that the problem is not AI but the way students are assessed. Essays, reports, presentations, and projects have served as basic assessment tools at universities for years. Generative AI makes it increasingly difficult to establish what a student actually completed versus what was machine-generated. The entire evaluation framework needs rethinking.

Consider the standard university workflow. A professor assigns an essay, students submit it, a grader evaluates it. This system worked when writing an essay required reading sources, synthesizing information, and composing original text. ChatGPT can produce a B-grade essay on most topics in under thirty seconds. The essay no longer measures what it was designed to measure.

The same applies to problem sets in STEM courses. When a student can photograph a problem or type it into an AI tool and receive a complete solution, the homework stops being evidence of learning. Grades on these assignments inflate while actual competence stagnates or declines. This grade inflation masks the real problem until exam day.

Antyweb’s analysis of Polish universities echoes this assessment. The article argues that universities need to stop pretending that AI is just a shortcut and acknowledge that it has fundamentally changed what take-home assignments measure. When a tool can generate passable work instantly, the assignment must change or lose all evaluative meaning.

The academic integrity article from Vietnam describes how plagiarism in the AI era is no longer the easily recognizable copy-paste phenomenon of previous decades. AI-generated text is unique each time, making detection far more difficult. Traditional plagiarism detection tools like Turnitin were built to find matching text, not to identify machine-generated original content. The old tools cannot solve the new problem.

Some institutions are experimenting with alternative assessments: oral exams, in-class writing under supervision, collaborative problem-solving observed by instructors, and project defenses where students must explain their work in real time. These methods are more resource-intensive but produce more valid measures of student learning.

What Happens When Students Can’t Distinguish AI Output From Their Own Work?

When students cannot distinguish AI-generated output from their own thinking, they experience what researchers describe as a competence illusion — a genuine belief that they understand material they actually cannot reproduce or apply. This phenomenon is particularly dangerous in fields like computer science and mathematics where foundational skills build on each other sequentially.

The Berkeley data illustrates this clearly. Students who submitted AI-assisted homework throughout the semester often expressed genuine surprise when they failed midterms. They had seen correct solutions associated with their name repeatedly. They had received positive feedback on assignments. The gap between their perceived and actual ability came as a shock. The system lied to them.

Academia PAN’s coverage suggests that this confusion extends beyond individual students to institutional level. When universities cannot distinguish between AI-generated and student-generated work, their grading systems produce inaccurate data about cohort learning outcomes. Departments make curriculum decisions based on inflated grades that do not reflect real student capabilities.

The Vietnamese academic integrity article describes plagiarism in the AI era as fundamentally different from traditional academic dishonesty. Students are not simply copying anymore. They are collaborating with an AI that produces unique, contextually appropriate text. The student may edit this output, add their own ideas, or submit it largely unchanged. Over time, the boundary between human and machine contribution blurs even for the student themselves.

Pulshr.pl reports that young people increasingly view AI as an extension of their own cognitive abilities rather than a separate tool. When asked whether they wrote something themselves, many students answer honestly that they are unsure. They prompted the AI, selected from its outputs, and refined the results. Is that writing? Is that learning? The definition has become genuinely ambiguous.

This ambiguity creates problems for academic integrity policies built on clear distinctions between original work and copying. When a student uses AI to generate a first draft, then rewrites portions, adds examples, and restructures arguments, the final product exists in a gray zone. Most university honor codes were not written to address this scenario. The rules no longer fit reality.

The consequences extend beyond grades. Students who graduate with AI-dependent skills enter a workforce where AI is available but where human judgment, critical evaluation, and independent problem-solving remain essential. Employers report that new hires who relied heavily on AI during university struggle when asked to work without it. The competence illusion does not survive first contact with real professional demands.

How Are Universities Responding to the AI Challenge in Classrooms?

Universities are caught between banning AI and integrating it, with most institutions still lacking coherent policies. According to Academia PAN, the real problem is not AI itself but the way students are assessed — traditional essays, reports, and presentations no longer reliably measure learning outcomes. Over 60% of Polish universities have not updated their assessment frameworks since ChatGPT’s release, leaving faculty to improvise individual approaches.

Some departments are experimenting with oral exams, in-class writing, and collaborative projects that resist easy automation. Others are investing in AI detection tools, though these remain unreliable and prone to false positives. The Academia PAN report argues that institutions must redesign evaluation from the ground up rather than playing technological whack-a-mole.

A handful of progressive universities have introduced “AI literacy” modules into first-year curricula. These courses teach students when AI use is appropriate and how to critically evaluate generated content. Still, adoption remains fragmented. Most faculty receive no training on AI pedagogy, and departmental policies vary wildly even within the same institution.

The tension is palpable. Administrators want consistency, instructors want autonomy, and students want clarity. Without institutional leadership, the result is a patchwork of ad hoc rules that confuse everyone involved.

Does AI Help or Hurt Students Who Already Struggle Academically?

AI tools create a deceptive illusion of competence for struggling students, masking foundational gaps until exams make them visible. Research from UC Berkeley’s computer science department found that students who relied heavily on AI assistants saw their failing grades increase by up to 31% compared to peers who used AI sparingly or not at all.

The mechanism is straightforward. When a student prompts ChatGPT to solve a calculus problem or debug code, they receive a correct answer without understanding the process. This bypasses the productive struggle that builds genuine comprehension. During take-home assignments, these students appear proficient. During proctored exams without AI access, their performance collapses.

Ironically, stronger students often benefit more from AI because they already possess the conceptual framework to evaluate and learn from generated outputs. They can spot errors, ask better follow-up questions, and integrate AI suggestions into existing mental models. Weaker students lack this foundation, making them more likely to accept wrong answers uncritically.

The Antyweb source highlights that treating AI purely as a shortcut disproportionately harms those who can least afford it — students already behind in foundational skills. Without structured guidance, AI becomes a crutch rather than a tutor, widening the achievement gap rather than closing it.

What Do Young People Actually Think About Learning Without AI?

Most students under 25 cannot imagine studying without AI tools, viewing them as essential as search engines were to previous generations. A survey cited by Pulshr found that over 70% of young learners consider AI indispensable for organizing work, analyzing data, and completing assignments on time.

This is not laziness. Students describe AI as a productivity multiplier that helps them manage overwhelming workloads across multiple courses. Many report using ChatGPT or Claude to break down complex topics, generate study outlines, and practice foreign languages. For them, removing AI from the learning process feels like being asked to write essays by hand — technically possible but pointlessly inefficient.

At the same time, students express anxiety about becoming dependent. Some worry they are losing the ability to think through problems independently. Others describe a nagging feeling that their degrees will be devalued if employers learn they relied on AI throughout university. The Pulshr report notes that young people want guidance on responsible AI use, not blanket prohibitions that feel disconnected from reality.

Teachers, meanwhile, are scrambling to keep pace. Many acknowledge that AI is here to stay but feel unprepared to integrate it meaningfully into their teaching. The generational divide is stark — students see AI as normal, while many instructors still view it as a threat to be contained.

How Can Educators Redesign Assignments to Work With AI Instead of Against It?

Effective assignment redesign starts from a uncomfortable premise: any take-home work can and will be completed with AI assistance. The Academia PAN analysis recommends shifting from product-based grading to process-based evaluation, where students document their thinking, justify their approaches, and reflect on mistakes.

Several concrete strategies have emerged from early experiments across universities:

Oral examinations that require students to explain their reasoning in real time, making AI dependence immediately visible
In-class writing sessions where students produce work under supervision without digital tools
Collaborative problem-solving where groups work together with AI tools available, then present and defend their solutions
Process portfolios that track iterative drafts, showing how understanding evolved over weeks
AI-transparency policies where students must disclose which tools they used and how, graded on honesty rather than avoidance
Scaffolded assignments breaking complex tasks into smaller steps, each requiring demonstrated competency before proceeding
Peer review cycles where students evaluate each other’s work, developing critical judgment skills
Reflective journals asking students to articulate what they learned, not just what they produced

Strategy	AI-Resistant?	Scalability	Student Reception
Oral exams	High	Low	Mixed
In-class writing	High	Medium	Positive
Process portfolios	Medium	High	Positive
AI-transparency	Low	High	Very positive
Peer review	Medium	High	Positive

The key insight from the Academia PAN report is that fighting AI is a losing battle. Assignments should teach students to work with AI critically, not pretend it does not exist. This means designing tasks where AI provides partial help but genuine understanding remains necessary for success.

What Does Academic Integrity Mean When AI Can Generate Any Essay?

Academic integrity is being fundamentally redefined, moving away from originality as sole metric toward transparency and demonstrated understanding. The concept of plagiarism itself is shifting — AI-generated content is not copied from another source, yet it is not the student’s own work either. This gray zone exposes the inadequacy of existing frameworks.

According to the Vietnam.vn analysis, plagiarism in the AI era is no longer a simple copy-paste phenomenon. Students can prompt ChatGPT to produce unique, well-structured essays that pass every traditional plagiarism checker. Turnitin and similar tools were designed to detect matching text, not to determine whether a human actually composed the thoughts. This creates an enforcement vacuum.

Some institutions are responding by expanding honor codes to explicitly address AI usage. These updated codes typically require students to disclose when and how they used generative AI tools. The focus shifts from “did you copy” to “can you explain what you submitted.” If a student cannot articulate the reasoning behind their own essay, that becomes the integrity violation — regardless of who or what wrote the initial draft.

The deeper philosophical question remains unresolved. If a student uses AI to generate an essay, then edits, fact-checks, and restructures it substantially, is the final product theirs? Most academics say yes — but only if the student genuinely understands every claim made. Integrity, in this new model, lives in comprehension rather than composition.

Frequently Asked Questions

Are students who use AI more likely to fail courses?

Data from UC Berkeley’s CS courses shows that heavy AI users experienced a 31% increase in failing grades compared to light users, particularly in subjects requiring mathematical reasoning. Students who relied on AI for homework solutions consistently underperformed on proctored exams where AI tools were unavailable, revealing gaps in foundational understanding.

Can universities detect AI-generated student work reliably?

Current AI detection tools remain unreliable for academic purposes, with false positive rates high enough to risk accusing innocent students. The Vietnam.vn source notes that AI-generated plagiarism no longer resembles traditional copy-paste, making conventional plagiarism checkers largely ineffective against well-prompted AI outputs.

How does AI specifically affect math and computer science grades?

In Berkeley’s computer science program, students who used AI assistants for coding assignments showed declining ability to solve novel problems independently, with math-specific skills deteriorating most noticeably. The Antyweb analysis confirms that AI acts as a shortcut that bypasses the productive struggle essential for building quantitative reasoning skills.

What assessment methods work best in the age of generative AI?

The Academia PAN report recommends process-based evaluation methods including oral examinations, in-class supervised work, and documented iterative drafts that show student thinking over time. These approaches focus on demonstrated understanding rather than polished final products that could be AI-generated.

Summary

The relationship between AI tools and student performance is more complicated than simple narratives suggest:

Failing grades correlate with heavy AI reliance, particularly in math-intensive courses where foundational understanding cannot be faked during proctored exams
Assessment methods must evolve from product-based grading to process-based evaluation that values demonstrated comprehension over polished outputs
Students view AI as essential, not optional, making blanket prohibition unrealistic and counterproductive
Academic integrity needs redefinition around transparency and understanding rather than originality alone
Universities are responding too slowly, with most institutions lacking coherent AI policies or faculty training programs

The schools that figure this out first will produce graduates who can actually think alongside machines rather than depend on them. Read the full analysis on AI in education and subscribe for weekly updates on technology’s impact on learning.