Skip to main content

Text Generation

Text generation covers the tasks where a model produces language rather than just labelling it: writing open-ended text, summarising a longer document, and answering questions. These are the tasks closest to what people picture when they think of modern language models, and they are where African languages fall furthest behind, because generation needs both large, clean text to learn from and careful human data to steer and check it. Benchmarks that test large language models across the continent find a persistent gap between English and African languages on exactly these tasks (AfroBench).

This chapter covers three generation tasks:

The three text-generation tasks and their shared reliance on native-speaker human evaluation

What the three tasks share

The pages that follow differ mainly in what the data looks like and how it is scored. The rest of the pipeline is shared, and it is the pipeline this playbook has already covered. Source and clean the text with care for African scripts and web noise (see Data Collection and Data Quality). Design the annotation task and train native-speaker annotators (see Annotation Design). Assure quality and measure agreement, then release under a clear, community-respecting licence (see Data Governance). The three task pages below concentrate on what is distinctive to each: its definition, its data sources, and its evaluation. One theme runs through all of them: automatic metrics are weak for African-language generation, and native-speaker human evaluation is the dependable measure.

Contributor
@abumafrim

Join the discussion

Spotted an error, have a question, or want to share what worked on a real project? Sign in with GitHub to add your voice — every thread lives in the open, powered by GitHub Discussions.

Loading discussion…