AI Opportunities & Challenges: Tackling Bias In Grading

Share via:

Welcome back to our ongoing series on AI marking. We’ve touched on the innovative strides this technology has made, especially when it comes to AI-assisted essay marking. Beyond its remarkable benefits, such as the potential to reduce grading costs by approximately 60%, it’s crucial to also address the challenges including the types of AI bias.

In the previous blog post, we examined 10 common AI myths and discovered that AI marking has the potential to reduce bias in grading. However, we must guard against the AI inheriting biases present in its training data. Bias in grading can function like invisible ink, often unnoticeable under regular light but glaringly obvious under scrutiny. Although we often picture AI as a neutral, objective tool, it’s essential to understand that AI is a mirror that reflects its creators’ attributes – including their biases.

As we continue to integrate AI more deeply into our daily lives, the issue of AI bias in education, specifically its impact on grading and assessments, must be thoroughly examined and addressed. For AI to realise its true potential of enhancing our world and democratising opportunities, we must ensure that it is not just intelligent but also fair, unbiased, and genuinely representative of the diverse world it serves. So which types of AI bias should we watch out for?

Types Of AI Bias

Artificial intelligence and machine learning algorithms are only smart as the data is fed. They’re designed to recognise patterns and learn from them to make future decisions. If the input data reflects human biases, AI can perpetuate them.

Data Bias

Data bias occurs when the training data fed to an AI model is not representative of the environment the model will operate. It’s like teaching a child only about apples and expecting them to understand all fruits. For instance, if an AI is trained primarily on sample essays from urban test-takers, it may inadvertently favour language nuances, examples, or perspectives that are typical of urban environments. Thus, rural test-takers, or those discussing topics less familiar to urban experiences, might find their work undervalued or misinterpreted.

Data bias can happen in several ways:

Selection bias

This occurs if the sample of assignments used for training is different from all the possible types of assignments the AI system will grade. For instance, if the training data consists primarily of science assignments, the system might not grade history or literature assignments as accurately.

Underrepresentation or overrepresentation

If certain assignments or grading styles are underrepresented or overrepresented in the training data, the AI system might not generalise its grading accurately. For instance, if the system is trained mainly on assignments graded by strict teachers, it might grade more harshly than is fair.

Outdated data

If the training data is outdated, the AI system might not adapt to new trends or educational shifts. For example, if the AI is trained on older English literature assignments, it might not correctly grade assignments that analyse contemporary literature.

Confirmation Bias

Like humans, AI can also suffer from confirmation bias if its training data leans towards a particular outcome. If an AI grading system consistently feeds essays that favour a specific point of view and grades them highly, the system might learn to favour that perspective, devaluing contrasting viewpoints.

In the context of AI marking, an AI system might show confirmation bias if it tends to grade assignments in a way that confirms the tendencies in the training data. For instance, if the AI system was trained on data where teachers gave higher grades to assignments with longer word counts, the system might continue to favour longer assignments, even when they’re not necessarily of higher quality.

Bias in Interpretation

Algorithmic interpretation bias

This occurs if the AI system consistently interprets ambiguous data in a way that reflects the biases in the training data. For instance, if the AI system was trained on data where teachers tended to grade non-native English speakers lower, the system might continue this pattern, even when the quality of work is comparable.

User interpretation bias

This happens when users interpret the grades given by the AI system based on their own biases. For instance, a teacher might trust the AI system’s grading more when it matches their biases and discount it when it doesn’t.

Strategies for Mitigating AI Bias in Grading

As we’ve explored, AI marking offers transformative benefits, from significant cost reductions to streamlined workflows. However, the issue of various types of AI bias remains a critical concern that can undermine the technology’s potential for fair and accurate assessment.

For testing organisations and awarding bodies, particularly those considering the implementation of AI marking, it’s crucial to proactively address this challenge. Whether you’re selecting a third-party provider or using your own marking data to train the model, a comprehensive, multi-faceted approach is essential for mitigating the various types of AI bias. In the following section, we outline a unified guide to help you navigate this complex yet crucial aspect of AI-assisted marking.

Select a Credible Provider: Opt for a provider with a proven track record in ethical AI practices. Scrutinise their transparency reports and ask for case studies that demonstrate how they’ve effectively managed bias in the past. This will serve as a foundational layer if you’re using their base model.

Curate Balanced Data Sets: When using your own training data, ensure it’s comprehensive. Include assignments from diverse subjects, various grading styles, and a wide range of demographic groups. This diversity helps the model learn to evaluate assignments impartially.

Demand Comprehensive Validation: Whether you’re starting with a provider’s model or using your own data, validation is crucial. Use fairness metrics and cross-validation techniques to assess the model’s performance. Ensure that it doesn’t favour any particular group and that its evaluations are consistent across different demographics.

Review and Update Training Data: Periodically revisit your training data. Look for any emerging patterns of bias and make necessary adjustments. Keep the data updated to reflect current educational standards and the evolving diversity of your candidate pool.

Request Regular Audits: Make it a contractual or internal policy to conduct audits at regular intervals. These audits should scrutinise the model’s performance and flag any biases that may have crept in. If you’re working with a provider, ensure they commit to this level of scrutiny as well.

Educate Assessors: Your human assessors need to understand the AI model’s limitations. Provide training sessions that equip them to critically interpret AI-generated results. This will enable them to make adjustments where the model may fall short, ensuring a more balanced evaluation process.

Engage Regulatory Bodies: Keep abreast of any guidelines or standards set by educational and technological authorities. This ensures that your AI marking system remains compliant, whether you’re using an external model or developing your own.

Inform Candidates: Transparency is key. Make sure candidates know that an AI system is part of the evaluation process. Explain its role, its limitations, and how it’s used in conjunction with human judgement to arrive at a final grade.

By diligently following these steps, testing organisations and awarding bodies can mitigate major types of AI bias and contribute to a more equitable and reliable AI-assisted marking system.

The road to minimise AI bias in education might be complex, but it’s achievable with concerted effort. By implementing these strategies and working together, we can make strides towards fairer and more objective AI grading, taking us one step closer to an education system where every candidate gets the fair chance they deserve.

Stay tuned for our next blog, where we’ll explore how we can overcome the challenge of transparency in AI marking.

Share via:

Updated on 18 September, 2023

Topics

AI Marking, Trends in e-assessment

Cristina Gilbert

Copywriter and digital content enthusiast, Cristina is motivated by the fast-paced world of e-assessment and the opportunities online exams give students to thrive.

Would you like to receive Cirrus news directly in your inbox?

The Power of Feedback in Maximising Learning

Timely and detailed feedback is a powerful force in enhancing student learning. Learn how personalised feedback benefits students in understanding mistakes, preparing for resits, and navigating grade appeals, while also empowering educators to identify misunderstandings and motivate student engagement.

16 July, 2024

Better Assessments

Transforming Healthcare Training with Digital OSCEs

The healthcare sector is facing an unprecedented crisis. Are digital OSCEs the answer? Discover how this cutting-edge approach tackles the global shortage of healthcare professionals by streamlining assessments, providing instant feedback, and ensuring clinical competency.

16 July, 2024

Better Assessments

The Role of UX/UI in Enhancing E-Assessment

Effective UX/UI design goes far beyond aesthetics; it shapes an intuitive, accessible, and engaging user experience that can transform how we approach e-assessments. Discover how Cirrus’ expert, Madina Suleymanova, uses her expertise to improve usability, reduce stress, and enhance educational outcomes.

13 June, 2024

Cookie	Duration	Description
_lfa	1 year 1 month 4 days	This cookie is set by the provider Leadfeeder to identify the IP address of devices visiting the website, in order to retarget multiple users routing from the same IP address.
cookielawinfo-checkbox-advertisement	1 year	Set by the GDPR Cookie Consent plugin, this cookie is used to record the user consent for the cookies in the "Advertisement" category .
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
CookieLawInfoConsent	1 year	Records the default button state of the corresponding category & the status of CCPA. It works only in coordination with the primary cookie.
elementor	never	This cookie is used by the website's WordPress theme. It allows the website owner to implement or change the website's content in real-time.
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Cookie	Duration	Description
_hjAbsoluteSessionInProgress	30 minutes	Hotjar sets this cookie to detect a user's first pageview session, which is a True/False flag set by the cookie.
bcookie	1 year	LinkedIn sets this cookie from LinkedIn share buttons and ad tags to recognize browser ID.
bscookie	1 year	LinkedIn sets this cookie to store performed actions on the website.
lang	session	LinkedIn sets this cookie to remember a user's language setting.
lidc	1 day	LinkedIn sets the lidc cookie to facilitate data center selection.
UserMatchHistory	1 month	LinkedIn sets this cookie for LinkedIn Ads ID syncing.

Cookie	Duration	Description
_ga	2 years	The _ga cookie, installed by Google Analytics, calculates visitor, session and campaign data and also keeps track of site usage for the site's analytics report. The cookie stores information anonymously and assigns a randomly generated number to recognize unique visitors.
_ga_*	1 year 1 month 4 days	Google Analytics sets this cookie to store and count page views.
_ga_V7YDQ8J0Q4	2 years	This cookie is installed by Google Analytics.
_gcl_au	3 months	Provided by Google Tag Manager to experiment advertisement efficiency of websites using their services.
_hjFirstSeen	30 minutes	Hotjar sets this cookie to identify a new user’s first session. It stores the true/false value, indicating whether it was the first time Hotjar saw this user.
_hjRecordingEnabled	never	Hotjar sets this cookie when a Recording starts and is read when the recording module is initialized, to see if the user is already in a recording in a particular session.
_hjRecordingLastActivity	never	Hotjar sets this cookie when a user recording starts and when data is sent through the WebSocket.
_hjSession_*	30 minutes	Hotjar sets this cookie to ensure data from subsequent visits to the same site is attributed to the same user ID, which persists in the Hotjar User ID, which is unique to that site.
_hjSessionUser_*	1 year	Hotjar sets this cookie to ensure data from subsequent visits to the same site is attributed to the same user ID, which persists in the Hotjar User ID, which is unique to that site.
CONSENT	2 years	YouTube sets this cookie via embedded YouTube videos and registers anonymous statistical data.
ln_or	1 day	Linkedin sets this cookie to registers statistical data on users' behaviour on the website for internal analytics.

Cookie	Duration	Description
li_sugr	3 months	LinkedIn sets this cookie to collect user behaviour data to optimise the website and make advertisements on the website more relevant.
test_cookie	15 minutes	The test_cookie is set by doubleclick.net and is used to determine if the user's browser supports cookies.
VISITOR_INFO1_LIVE	5 months 27 days	YouTube sets this cookie to measure bandwidth, determining whether the user gets the new or old player interface.
YSC	session	Youtube sets this cookie to track the views of embedded videos on Youtube pages.
yt-remote-connected-devices	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt-remote-device-id	never	YouTube sets this cookie to store the user's video preferences using embedded YouTube videos.
yt.innertube::nextId	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.
yt.innertube::requests	never	YouTube sets this cookie to register a unique ID to store data on what videos from YouTube the user has seen.

Cookie	Duration	Description
_hjIncludedInSessionSample_3209364	2 minutes	Description is currently not available.
_lfa_test_cookie_stored	less than a minute	Description is currently not available.
AnalyticsSyncHistory	1 month	No description
li_gc	5 months 27 days	No description

AI Opportunities & Challenges: Tackling Bias In Grading

Share via:

Types Of AI Bias

Data Bias

Selection bias

Underrepresentation or overrepresentation

Outdated data

Confirmation Bias

Bias in Interpretation

Algorithmic interpretation bias

User interpretation bias

Strategies for Mitigating AI Bias in Grading

Share via:

Topics

Cristina Gilbert

Would you like to receive Cirrus news directly in your inbox?

More posts in Better Assessments

The Power of Feedback in Maximising Learning

Transforming Healthcare Training with Digital OSCEs

The Role of UX/UI in Enhancing E-Assessment

Industries

Resources

Services

About

Take Cirrus for a spin!