EdTech EfficacyEvidence PackJun 22, 2026, 6:05 AM· 5 min read

How AI Teaching Assistants Are Closing the University Achievement Gap

New empirical studies reveal that AI tutors can significantly boost university grades and reduce performance variability, provided students are taught how to interact with them effectively.

By Factlen Editorial Team

Pedagogical Optimists 40%Interaction Designers 35%Empirical Skeptics 25%
Pedagogical Optimists
View AI as a revolutionary tool to scale 1:1 tutoring, provide 24/7 support, and raise the academic floor.
Interaction Designers
Argue that the benefits of AI depend entirely on how access is structured and how students are taught to prompt.
Empirical Skeptics
Warn that voluntary AI use primarily benefits already-privileged students and can lead to cognitive offloading.

Why this matters

As universities rapidly integrate AI into their curricula, understanding how these tools actually affect learning is critical for students and parents. The data shows that AI can dramatically boost grades and close achievement gaps, but only if students are taught to use it as a reflective tutor rather than an answer key.

The holy grail of education has long been Bloom's 2 Sigma Problem—the 1984 finding that students who receive one-on-one tutoring perform two standard deviations better than those in traditional classrooms. For decades, scaling that level of personalized instruction was economically impossible. But the rapid maturation of large language models has shifted the landscape, prompting universities to deploy artificial intelligence as bespoke, 24/7 teaching assistants.[7]

The initial wave of generative AI in late 2022 triggered widespread panic over academic integrity, with institutions rushing to ban chatbots. By 2026, the paradigm has completely inverted. Higher education is now actively integrating proprietary AI tutors into core curricula, aiming to close achievement gaps and provide immediate, judgment-free support to thousands of students simultaneously.[4]

The most prominent early adopter was Harvard University's flagship computer science course, CS50. In 2023, the university deployed the "CS50 Duck," an AI-powered chatbot designed to approximate a 1:1 teacher-to-student ratio for its massive on-campus and online enrollment. Rather than handing out code snippets, the bot was fine-tuned to emulate human educators by offering style suggestions, explaining error messages, and guiding students toward solutions.[1]

The results from these early deployments are now yielding rigorous empirical data, and the headline numbers are striking. A 2026 quasi-experimental study published in Education and Information Technologies tracked university students using AI teaching assistants and found a significant boost in academic performance. Students who engaged with the AI scored an average of 9.09 points higher on assessments than their non-using peers.[2]

Empirical data shows that AI teaching assistants can significantly raise the academic floor for mid- and low-performing students.
Empirical data shows that AI teaching assistants can significantly raise the academic floor for mid- and low-performing students.

Perhaps more importantly for institutional equity, the same study recorded a 36.04% reduction in grade variability. The positive impact was most pronounced among low- and mid-performing students, suggesting that AI tutors can effectively raise the academic floor when deployed correctly. This aligns with pilot data from Fort Hays State University, where faculty reported that 24/7 AI availability dramatically improved student performance on subsequent assignments by providing immediate, iterative feedback before final submission.[2][4]

However, the data reveals a critical caveat: the benefits of AI tutoring are entirely dependent on the student's interaction strategy. The Education and Information Technologies study found that students employing "knowledge-reflective" questioning—asking the AI to explain concepts, debate arguments, or clarify misunderstandings—saw massive gains. Conversely, students who used a "copy-pasting" strategy to simply extract answers experienced a non-significant, and sometimes slightly negative, effect on their learning outcomes.[2]

However, the data reveals a critical caveat: the benefits of AI tutoring are entirely dependent on the student's interaction strategy.

This behavioral divergence is at the heart of what educational researchers call the "Five Percent Problem." When AI tools are introduced voluntarily without structured guidance, the benefits tend to accrue almost exclusively to a small subset of highly motivated, often already-privileged learners. These students use the AI to deepen their understanding, while the remaining 95 percent often use it merely to expedite assignment completion, leading to cognitive offloading and reduced long-term retention.[5]

Without pedagogical integration, voluntary AI use tends to benefit only a small fraction of highly motivated learners.
Without pedagogical integration, voluntary AI use tends to benefit only a small fraction of highly motivated learners.

If left unmanaged, this dynamic threatens to exacerbate the very educational inequalities that AI tutors are theorized to solve. Researchers warn that unguided AI use can act as an amplifier of inequality, turbocharging resource-rich students while leaving vulnerable populations behind. To combat this, institutions are being urged to shift from passive availability to proactive, integrated design—ensuring the AI challenges students before it helps them, and nudges them toward active retrieval practice.[5]

The question of how to restrict or grant access to these tools has also yielded surprising results. A 2025 randomized experiment conducted by the WZB Berlin Social Science Center tested different access models on 334 university students preparing for an exam. One group had restricted access, requiring them to read textbook material independently before consulting the AI, while another had unrestricted access from the start.[3]

Counterintuitively, the unrestricted access group significantly outperformed the restricted group by 0.21 standard deviations. Behavioral analysis revealed that unrestricted access allowed students to gradually and seamlessly integrate the AI into their learning flow. In contrast, restricted access induced intensive, disruptive bursts of prompting that broke the students' concentration, contradicting widespread faculty concerns about premature AI reliance.[3]

Counterintuitively, unrestricted access to AI tutors allows students to maintain their learning flow better than restricted access.
Counterintuitively, unrestricted access to AI tutors allows students to maintain their learning flow better than restricted access.

Despite these promising mechanisms, AI is not a guaranteed panacea. A recent randomized controlled field experiment involving 450 undergraduate students tested the efficacy of a retrieval-augmented generation (RAG) AI tutor across both in-person and asynchronous online modalities. Despite high expectations, the researchers found that the generative AI tutor had no statistically significant impact on student interest, self-efficacy, or academic achievement.[6]

These null results serve as a vital reality check for higher education administrators. They suggest that simply purchasing an AI software license and making it available to the student body is insufficient. The technology only moves the needle when it is deeply woven into the pedagogical fabric of the course, accompanied by explicit instruction on prompt engineering and metacognitive reflection.[6][7]

Faculty members are increasingly recognizing this need for structured integration. At some universities, professors are designing assignments where students are required to "teach" the AI, or where the AI is deliberately coded to make mistakes that the students must identify and correct. This role-reversal forces active engagement and prevents the passive consumption of AI-generated text.[4]

The most successful AI deployments occur when the technology is deeply woven into the pedagogical fabric of the course.
The most successful AI deployments occur when the technology is deeply woven into the pedagogical fabric of the course.

Ultimately, the evidence suggests that AI teaching assistants represent a profound shift in higher education, but one that requires careful architectural oversight. The technology possesses the raw capability to democratize the 1:1 tutoring experience and close achievement gaps for struggling students. Yet, realizing that potential demands a transition from viewing AI as an autonomous oracle to treating it as a collaborative instrument that requires deliberate, taught mastery.[2][5][7]

Viewpoints in depth

Pedagogical Optimists

Proponents argue that AI finally solves Bloom's 2 Sigma Problem by providing every student with a tireless, judgment-free tutor.

This camp points to data showing significant reductions in grade variability and emphasizes that 24/7 availability disproportionately helps non-traditional students who study outside normal office hours. By automating routine explanations and debugging, AI frees up human professors to engage in higher-order mentoring and complex problem-solving with their students.

Interaction Designers

This camp emphasizes that AI is not magic; it is an instrument that requires explicit mastery and structured integration.

Interaction designers cite evidence showing that unrestricted access and 'knowledge-reflective' prompting yield massive gains, while poor interaction design leads to cognitive offloading. For them, the focus must be on teaching students how to prompt effectively. They advocate for assignments where students must debate the AI or correct its deliberate mistakes, forcing active engagement rather than passive consumption.

Empirical Skeptics

Skeptics warn that without careful oversight, AI tools risk exacerbating existing educational inequalities.

Pointing to randomized controlled trials that show null results, skeptics warn of the 'Five Percent Problem.' They argue that highly motivated, resourced students will use AI to accelerate their learning, while struggling students may use it as a crutch to bypass critical thinking. If institutions merely purchase software licenses without overhauling their pedagogy, the achievement gap will likely widen.

What we don't know

  • How the long-term use of AI tutors affects students' independent critical thinking and problem-solving skills over a multi-year degree.
  • Whether the grade improvements seen in early pilot studies will persist as the novelty of AI interaction wears off.
  • How to effectively train the 95% of students who default to 'copy-pasting' to adopt 'knowledge-reflective' prompting strategies.

Sources

Source coverage

7 outlets

3 viewpoints surfaced

Pedagogical Optimists 40%Interaction Designers 35%Empirical Skeptics 25%
  1. [1]Harvard UniversityPedagogical Optimists

    Teaching CS50 with AI: Approximating a 1:1 Teacher-to-Student Ratio

    Read on Harvard University
  2. [2]Education and Information TechnologiesInteraction Designers

    Effects of AI teaching assistants on students' learning outcomes: The mediating role of questioning strategies

    Read on Education and Information Technologies
  3. [3]WZB Berlin Social Science CenterInteraction Designers

    How AI Tutoring Affects Learning in Higher Education

    Read on WZB Berlin Social Science Center
  4. [4]EdTech MagazinePedagogical Optimists

    AI Teaching Assistants Improve Student Performance

    Read on EdTech Magazine
  5. [5]Educational ResearchersEmpirical Skeptics

    The Five Percent Problem: Why AI Tutors Might Widen the Achievement Gap

    Read on Educational Researchers
  6. [6]ResearchGateEmpirical Skeptics

    Evaluating the Efficacy of Generative AI Tutors in Higher Education: A Randomized Controlled Trial

    Read on ResearchGate
  7. [7]Factlen Editorial TeamInteraction Designers

    Synthesis by Factlen editorial team

    Read on Factlen Editorial Team
Stay informed

Every angle. Every day.

Get education stories with full source coverage and perspective breakdowns delivered to your inbox.