Feeds:
Posts
Comments

Posts Tagged ‘ChatGPT’

Photo: Patrick Hendry/Unsplash.
AI can translate the individual words, and perhaps common phrases, but without nuance, all you get is fog.

I worry a lot about AI in education, especially as, in my volunteer capacity teaching English as a second language, I see repeatedly how it interferes with learning. How to Copy and Paste — that is what students are learning, I fear.

At the Conversation, Gareth Barkin, professor of Anthropology and Asian Studies at the University of Puget Sound, explains why students may not even get an accurate translation when they use AI.

“A friend in Indonesia recently told me about a conversation he had with ChatGPT,” he writes. “He had typed a question in Indonesian – Bahasa Indonesia – about how to handle a difficult family dispute. The chatbot responded fluently, in perfect Indonesian, with advice about communication strategies and conflict resolution. The grammar was flawless. The tone was appropriate. And yet something felt off.

“What the AI offered was advice rooted in American cultural assumptions: prioritize your own preferences, communicate directly, and if family members don’t respect your boundaries, consider cutting them off.

“The response was in Indonesian but shaped by values that centered individual autonomy over the consensus-building, social harmony and collective family dynamics that tend to matter more in Indonesian social life.

“My friend was skeptical enough to notice the mismatch. … Many users might not. That is what prompted my research, published in the International Review of Modern Sociology, into a pattern I found across major AI systems: Even when they were fluent in several languages, the language models retained their Western worldview. I call this ‘epistemological persistence.’ …

“Large language models – LLMs – like ChatGPT, Claude and Gemini can now speak dozens of languages with remarkable fluency. That fluency creates the impression that AI understands local cultures.

“Producing grammatically correct Indonesian, Arabic, Swahili or Hindi, however, does not change the underlying worldview through which these systems reason. It does not alter how they think about people, relationships, responsibility or what counts as a good outcome.

“Those assumptions are shaped by training data drawn predominantly from English-language sources based in the United States. Meta’s open-weight model LLaMA 2 was trained on approximately 89.7% English-language textLLaMA 3 includes only about 5% non-English data. Major commercial models don’t publish equivalent breakdowns but draw heavily on the same sources. Arabic, the fifth-most-spoken language globally, accounts for under 1% of content in large training datasets. Languages with tens of millions of speakers, including Bengali and Hausa, barely appear.

“Beneath the surface of these multilingual conversations, English functions as a hidden intermediary. A study by researchers at the University of Oxford found that LLMs routinely conduct their core reasoning in English, even when prompted in other languages. They translate the output at the final stage. A user receives flawless text in their preferred language, but the underlying logic originates elsewhere.

“To examine how this plays out in practice, I ran experiments with ChatGPT, Claude and Gemini. I asked questions in both English and Indonesian about concepts such as education, responsibility, well-being and several Indonesian terms that resist direct translation into English. These included terms such as ‘gotong royong,’ which describes a tradition of communal mutual assistance.

“Then I asked questions about education in both languages, using the word ‘pendidikan’ in Indonesian. The answers were consistently centered on individual development, personal autonomy, critical thinking and preparation for the labor market.

“What largely disappeared were the dimensions of pendidikan that Indonesian educational traditions have historically emphasized. In Indonesia, education has long been focused on ethical discipline. …

“The Indonesian concept of ‘malu’ offers a starker example. Often translated as ‘shame’ or ’embarrassment,’ malu has been analyzed by anthropologists Clifford Geertz and Tom Boellstorff as something closer to a shared social awareness.

“A person might feel malu when speaking out of turn in front of elders, or when a family member’s behavior reflects poorly on the household. It regulates conduct and signals awareness of one’s position within a web of relationships. It is cultivated, not merely felt. It is a form of relational awareness rather than a private psychological event.

“When asked directly to define malu, the models acknowledged its social dimensions. In scenario-based questions that simply used the word without asking for a definition, however, all three fell back on the English translation of shame, consistently framing it as an individual emotional experience.

“One representative response framed malu as a normal emotional reaction to be managed through self-reflection and confidence-building – a personal psychological problem rather than a social one. The relational dimensions of the concept disappeared entirely, replaced by the language of individual emotional regulation. A distinctly American worldview travels inside the translation, largely unannounced. …

“As media scholar Safiya Umoja Noble argues about algorithmic systems more broadly, what looks like a technical outcome is actually a structural one, shaped by who has the wealth and infrastructure to build these systems. …

“The main exceptions are Chinese models such as DeepSeek and Alibaba’s Qwen. They represent a genuine alternative to the U.S.-dominated pipeline, though research shows they operate through a distinctly Chinese cultural lens. Asked about a workplace disagreement, for instance, they tend to advise silence or indirect phrasing to preserve harmony rather than the direct, private correction that Western models recommend.”

More at the Conversation, here. I wonder if any of you who speak different languages have run into any of these translation issues.

Read Full Post »

Photo: MinnPost.
Cynthia Tu of Sahan Journal is using Chat GPT to improve revenue streams.

A few times in the past, I’ve had reason to link to a story at Sahan Journal, a nonprofit newsroom serving immigrants and communities of color in Minnesota. Now NiemanLab, a website about journalism, links to an article on a surprising development at the small publisher.

Lev Gringauz, reporting at MinnPost via NiemanLab, writes “As journalists around the world experiment with artificial intelligence, many newsrooms have common, often audience-facing, ideas for what to try.

“They range from letting readers talk to chatbots trained on reporting, to turning written stories into audio, creating story summaries and, infamously, generating entire articles using AI — a use case vehemently rejected by many journalists.

“But Sahan Journal, the nonprofit newsroom serving immigrants and communities of color in Minnesota, wanted to try something different.

“ ‘We’re less enthusiastic, more skeptical, about using AI to generate editorial content,’ said Cynthia Tu, Sahan Journal’s data journalist and AI specialist.

“Instead, the outlet has been working on ways to support internal workflows with AI. Now, it’s even testing a custom ChatGPT bot to help pitch Sahan Journal to prospective advertisers and sponsors. …

“While AI has plenty of ethical and technical issues, Tu’s work highlights another important aspect: The intended users — in this case, the Sahan Journal team.

“ ‘A lot of … this experiment is less of a technical challenge,’ Tu said. ‘It’s more like, how do you make [AI] fit in the human system more flawlessly? And how do you train the human to use this tool in a way that it was intended?’

Sahan Journal’s AI experimentation, and Tu’s job, are supported by a partnership between the American Journalism Project, a national nonprofit helping local newsrooms, and ChatGPT creator OpenAI. …

Liam Andrew, technology lead for the AJP’s Project & AI Studio, sees part of his job as helping newsrooms overcome hesitancy around AI. …

“Tu joined Sahan Journal fresh from a Columbia Journalism School master’s program in data journalism. She had played a little with chatbots, but otherwise didn’t have much experience working with AI. …

“For one investigation, Tu used a Google AI tool to process the financial data of charter schools in Minnesota. Thinking about how to save time on backend workflows, Tu then helped Sahan Journal generate story summaries, tailored for Instagram carousels, with ChatGPT. …

“ ‘You need to know what the workflow of the organization looks like…[and how] you push for change within a department when they’ve already been doing [something] for the past five years using a manual or human labor way.’

“That knowledge came in handy when finally tackling Tu’s core AI project: improving Sahan Journal’s revenue.

“The project stemmed from an anonymized database of audience insights, which included demographic information and interests. While an important resource, Sahan Journal’s small revenue team didn’t have the time to figure out how to leverage it. …

” ‘What if AI could feed two birds with one scone? A custom ChatGPT bot could process the audience data and personalize a media kit for clients. But it needed to work without being an extra burden on the revenue staff. …

“The magic of AI chatbots like ChatGPT is that you don’t need to know how to code to use them. Just type in a prompt and get rolling. …

“Less magically, AI chatbots can be hard to keep in line for specific tasks. Designed to be eager helpers, they hallucinate false results and stubbornly twist instructions in an attempt to please.

“Troubleshooting those issues was no simple task for Tu.

“The custom revenue chatbot struggled to keep Tu’s preferred formatting, and hallucinated audience data. The bot would also intermix results from the internet that Tu had not asked for. None of that was ideal for a tool that should work reliably for the revenue team.

“ ‘I was kind of jumping through hoops and telling it multiple times, “Please do not reference anything else on the internet,” ‘ Tu said. …

“Working with chatbots is an exercise in prompt engineering — mostly a trial-and-error process of figuring out what specific instructions will get the preferred result. As Tu said, ‘lazy questions lead to lazy answers.’ … Eventually, Tu settled on a reliable set of prompts.

“The custom chatbot takes about 20 seconds to find relevant data from the audience database — for example, pulling up how much of Sahan Journal’s audience cares about public transportation. Then it creates a summary for a media kit tailored to potential clients.

“The chatbot also double-checks its work by referencing the database again, making sure its output matches reality. And part of the database is shown for users to manually see the chatbot isn’t hallucinating. …

“Earlier this year, Tu introduced the final version of the revenue bot to Sahan Journal’s team. …

“By mid-April, the Sahan Journal revenue team had used the custom chatbot on six sales pitches, with three successfully leading to ads placed on the site. …

“But there’s a larger question hanging over this work: Is it sustainable? In a way, newsroom experiments with AI exist in a bubble.

“ ‘Everything is kind of tied to a grant,’ Tu said, referencing the AJP-OpenAI partnership that supports her work. But grants come and go as donor interests (and financials) change.”

The other unknowns are weighed at NiemanLabs, here.

Read Full Post »