Module 3 of 6

AI me te raupatu matihiko

AI and digital extraction — tracing the colonial logic inside machine learning.

Tohu o tēnei akoranga — Module symbol
Niho taniwha

taniwha teeth — resistance, warrior strength against extraction

This module carries niho taniwha — the teeth of the taniwha. The taniwha is not simply a monster; it is a guardian of boundaries. When data is extracted without consent, the taniwha teeth are the challenge: who has the right to cross this threshold?

He kupu whakataki — Introduction

Raupatu — dispossession — is one of the defining words of Māori history. It describes the large-scale confiscation of Māori land under the New Zealand Settlements Act 1863. Today, a new form of raupatu is underway: the extraction of Māori data, cultural knowledge, and linguistic heritage by AI systems built and owned by corporations with no relationship to Māori communities.

This module examines the structural logic of AI development and asks: in what ways does the current AI economy reproduce the extractive patterns of colonialism?

"The model is trained on your words, your stories, your knowledge. The profit goes to Silicon Valley. This is colonialism with better graphics."

How AI systems extract cultural data

Large language models (LLMs) like GPT-4 and its successors are trained on massive datasets assembled from the internet. These datasets include vast quantities of te reo Māori text, Indigenous knowledge, oral traditions transcribed and published online, and community-generated content — all gathered without consent, and all used to build commercial products.

The myth of "public data"

AI companies defend these practices by pointing to the "public" status of the data they collect. This argument fails on multiple grounds:

"Just because you can take something does not mean you have the right to take it. Māori knew this long before intellectual property law existed."

AI, power, and political economy

The AI economy is highly concentrated. A small number of US corporations — OpenAI (backed by Microsoft), Google, Meta, Amazon — dominate foundational AI development. These corporations operate under US law, answer to US shareholders, and are not subject to tikanga Māori, Treaty obligations, or New Zealand sovereignty claims over Māori cultural data.

This concentration of AI capability mirrors the concentration of colonial land ownership in the 19th century. The mechanisms differ, but the underlying dynamic — the capture of collective resources by private actors — is structurally similar.

Key concepts

Pātai — Discussion questions

  1. Is the training of AI models on publicly available Māori language data a form of raupatu? Defend your answer.
  2. How does the concentration of AI development in US corporations create particular risks for Indigenous communities globally?
  3. What practical mechanisms could give Māori communities meaningful control over how their data is used in AI training?
  4. Is there a version of AI development that would be acceptable from a Māori data sovereignty perspective? What would it look like?

AI and Digital Extraction — Social toolkit

Activity — Raupatu audit

Name one AI product that uses indigenous data without consent. Research it: who built it, what data did it train on, who profits, who loses? Write 200 words or present it as a thread.

AI companies are training on Māori voices, faces, waiata, and stories without consent. Different century, same logic as the land wars. He raupatu tonu tēnei. Module 3 👉 [link] #AIActivism #MāoriDataSovereignty #Raupatu

The taniwha's teeth are at the threshold. Every time an AI trains on Māori data without consent, that's a bite. He raupatu tonu tēnei. Free course. Link in bio. #KiwiDialectic #AIActivism

1/ AI is the latest form of raupatu. Let me show you what I mean. 🧵 2/ Raupatu = confiscation. 1860s New Zealand: the Crown took Māori land without consent or compensation. Millions of acres. Entire tribal territories. It was legal. It was still theft. 3/ In 2024: AI companies take Māori language data, oral recordings, faces, stories, and waiata to train their models. Without consent. Without compensation. It's legal. It's still theft. 4/ The mechanism is different — data pipelines instead of the Native Lands Court. But the structure is identical: extract value from indigenous people, privatise the profit, leave the community with nothing. 5/ The resistance looks different too: Te Pā Tūwatawata, Te Hiku Media's kaitiakitanga AI policy, Te Mana Raraunga. Building the palisades. Free course 👉 [link] #AIActivism #Raupatu #MāoriDataSovereignty

↓ Download campaign image Use as post image on any platform · Niho Taniwha motif

Hashtag bank — copy as needed

#KiwiDialectic #MāoriDataSovereignty #TinoRangatiratanga #AIActivism #TeReoMāori #KaupapaMāori #TePāTūwatawata #IndigenousRights #Aotearoa #DataSovereignty #SovereignFutures #Niho Taniwha