Crowdsourcing datasets

For low-resource languages across Africa.
Fueling innovation tailored to local needs!

tab
phone
Our Approach

Crowdsourcing

icon

Collecting data from local annotators, ensuring cultural accuracy and diverse datasets.

Social impact and fair Wages

icon

Creating micro-work opportunities and ensuring fair wages for youth and Women.

Hybrid Labeling

icon

Combining human and automated labeling for accurate datasets.

Segmented Data

icon

Creating distinct training, validation, and test sets for LLMs, enhancing adaptability across sectors.

Data Privacy & Ethics

icon

Ensuring ethical dataset through legal compliance and data ownership respect.

Marketplace

icon

Providing a platform for companies to purchase datasets, facilitating access to high-quality data.

Why Leyu?
 

Inclusive datasets

We move beyond generic datasets, offering accurate data in Amharic, Oromiffa, and Tigrigna.

 

Empowers local annotators

Through a crowdsourcing platform, Leyu empowers local annotators, especially women and youth to contribute their voices particularly in AI.

 

Ethical data use

We champion ethical data use and fair compensation to fuel local innovation designed for local challenges.

 

Language

Preserving and amplifying Ethiopian voices by collecting data in their native languages, ensuring their voices are heard and reflected in dataset development.

What We Do
what we do
Our Values

Data Quality and Accuracy

icon

Diverse and high-quality data for reliable AI performance.

Accessibility

icon

Accessible and cost-effective solutions for various sectors.

Impact

icon

Solutions addressing societal challenges in agriculture, health, education and beyond.

Collaboration and Partnership

icon

Sector-specific language datasets for key industries like agriculture, health, and education and beyond.

partner_imagepartner_image

Join us in bridging the AI gap and driving meaningful change in Ethiopia and across Africa.

Get In Touch