Muhammad Saghir — Senior Product Designer

Muh.

My Role

UX UI Designer

Responsibilities

Research · Interaction · Visual Design · Testing

Duration

Aug 2020 – Jun 2021

SUS Score

67.6%

Overview

The Challenge with Automated Extraction

Even though there are automated systems to extract information from research papers, we can't confidently verify whether this information is correct or not.

In deep learning, data is the foundation — model performance directly depends on the quality and accuracy of the data you feed in. Yet the manual annotation process that ensures this quality was time-consuming, tedious, and frustratingly error-prone.

"We are moving slowly into an era where big data is the starting point, not the end."

Problem Context

Why Accurate Annotation Matters

Automated systems to extract information from research papers exist, but their accuracy is often questionable. In the context of deep learning, the quality of data directly influences model performance. However, the process of manually annotating data is often time-consuming, tedious, and prone to human error.

Given the importance of accurate data, this process is critical for ensuring that machine learning models perform at their best. The problem wasn't just technical — it was deeply human. We needed to understand the researchers, data scientists, and industry experts who lived inside this process every day.

⚠️

High Error Rate

Manual annotation suffers from inconsistency and human bias

⏱️

Low Efficiency

Tedious workflows drain researcher time and motivation

📉

Model Impact

Poor data quality directly degrades deep learning performance

Discovery

Getting to Know Users

At the start of the project, I supported extensive user research and competitor analysis to familiarize data scientists and project managers with the current annotation products. For me, that stage is crucial — I love to start projects by gathering as much information and data as I can, and analysing them to better understand the problem I'm trying to solve.

Due to the constraints posed by the COVID-19 pandemic, we conducted online surveys and telephone interviews to gather valuable insights into customer needs.

Stakeholder Interviews

Telephone and online interviews with data scientists and project managers to surface their workflows, pain points, and existing tool habits.

Competitor Analysis

Deep dive into existing annotation tools to identify patterns, gaps, and opportunities for differentiated design.

Survey Research

Online surveys to gather quantitative data on user preferences, frequency of annotation, and satisfaction with current tools.

Iterative Testing

Low-fidelity paper sketches evolved into high-fidelity mockups, validated through continuous usability testing cycles.

Personas

Who Are We Designing For?

Based on our research, we identified two primary user archetypes whose needs shaped every design decision throughout the project.

Dr. Sarah Müller

Senior Data Scientist

Age 32 · Research Institute

"I spend more time fixing annotation inconsistencies than actually training models. There has to be a better way."

✓

Goals

Ensure high-quality, consistent annotation data for her ML models

Reduce time spent on manual review and error correction

Have full visibility into annotation progress across the team

Pain Points

Current tools are unintuitive and require technical onboarding

No feedback loop — she can't tell if annotators understood the task

Context switching between papers, tools, and spreadsheets

James Okafor

ML Project Manager

Age 38 · BioTech Startup

"Keeping the team motivated through repetitive annotation tasks is the hardest part of my job. I need a tool that helps with that."

✓

Goals

Maintain team productivity and motivation over long annotation projects

Track contribution and quality across multiple annotator roles

Deliver labeled datasets on time and within budget

Pain Points

No performance tracking — hard to identify top contributors vs. laggards

Annotation quality drops off significantly after the first few weeks

Lack of reward or recognition structures for team effort

Approach

Designing for Delight

Our mission was to reimagine the annotation process for data scientists. Rather than viewing it as a mundane task, we aimed to transform the experience into something that data scientists would enjoy. To achieve this, we worked closely with data scientists to understand their workflows, pain points, and the unique context in which they operate.

This collaboration was challenging but highly rewarding, as it provided us with invaluable insights into how we could improve the tool's usability. By conducting user interviews and iterative testing, we developed an annotation application that was intuitive and efficient.

We focused on creating a streamlined user interface that minimized friction, enhanced productivity, and made the process of annotating research papers feel less like a chore and more like a rewarding task.

MVP Definition

User Roles & Gamification Structure

Based on our sketching sessions and deep collaboration with data scientists, we identified three primary user roles for the gamified annotation application.

🔍

Role 01

Selector

The Selector chooses research papers and categorizes them by Disease, Variant, and Gene, ensuring the papers are correctly assigned for annotation.

✏️

Role 02

Annotator

The Annotator labels relevant information from the papers, extracting key data points and relationships between Disease, Variant, and Gene.

🎯

Role 03

Reviewer

The Reviewer, an industry expert, verifies the accuracy of the data provided by the Selector and Annotator, ensuring high-quality annotations.

Annotation Workflow

Selector

Assigns Papers

→

Annotator

Labels Data

→

Reviewer

Validates Quality

Gamification Features

To motivate users and sustain engagement across long annotation projects, we introduced two core gamification elements:

🏆

Leadership Board

A Leadership Board tracks user points based on their contributions (Selection, Annotation, Review), fostering healthy competition and continuous engagement across the team.

🎁

Earn Rewards

Users earn lunch vouchers as rewards for their efforts in annotation, selection, and review tasks, encouraging ongoing participation and creating a tangible incentive structure.

Design

High Fidelity Mockups

The final tablet designs use a cohesive purple gradient system — transforming annotation from a dry task into an engaging, mission-driven experience across every screen.

Splash & Onboarding

Splash Screen

First-launch experience — brand intro with tagline, key stats, and a single CTA to get started

Authentication Screen

Clean login with Microsoft SSO and Google Workspace — minimal friction entry to the workspace

Dashboard

User Dashboard

Overview of active missions, personal stats, and available tasks to join

Mission Setup

Disease Selection

Pick a disease mystery game — Cystic Fibrosis, Alzheimer's, Breast Cancer and more

Mission Target

Select a specific research target — CFTR Trafficking with player counts and mission details

Role Selection

Choose Your Difficulty Level

Select Selector, Annotator, or Reviewer role — each with distinct skills and difficulty

Core Annotation Flow

Calibration Check

Before annotating — verify you're reading the correct paper with a calibration question

Full Paper Reading

Read the full research paper with highlighted entities before annotating

Assay Method

Structured annotation of assay methods — answer targeted questions about experimental procedures

Protein Activity Annotation

Answer structured questions about protein functions extracted from the paper

Rewards & Leadership Board

Mission Congratulations

Mission complete — earn points, view mission progress summary, and continue to next question

Leadership Board

Ranked leaderboard tracking all contributors — Selector, Annotator, Reviewer points

Validation

Usability Testing

Due to the constraints posed by the COVID-19 pandemic, we conducted online surveys and telephone interviews to gather valuable insights. Using this data, I created low-fidelity mockups through paper sketching, which were then refined into high-fidelity mockups. For usability testing, we utilized the System Usability Scale (SUS) to evaluate and measure the user experience.

The System Usability Scale (SUS)

Participants were asked to score the following 10 items with one of five responses ranging from Strongly Agree to Strongly Disagree:

I think that I would like to use this system frequently.

I found the system unnecessarily complex.

I thought the system was easy to use.

I think that I would need the support of a technical person to be able to use this system.

I found the various functions in this system were well integrated.

I thought there was too much inconsistency in this system.

I would imagine that most people would learn to use this system very quickly.

I found the system very cumbersome to use.

I felt very confident using the system.

I needed to learn a lot of things before I could get going with this system.

System Usability Scale

85.5

/ 100

V1 Research Baseline · Above Average for First Build

Industry Benchmark Scale

Unacceptable

Poor

Good

Excellent

100

85.5

Discovery Clue V1

Average SUS (all products)

Typical annotation tool

67.6 exceeds the average annotation tool benchmark and aligns with the global SUS average — a strong foundation for the next iteration.

Outcome

Impact & Reflection

The result was an application that not only improved the accuracy and efficiency of the annotation process but also encouraged data scientists to engage with it in a more meaningful way. Through a combination of clear design, user-friendly features, and continuous user feedback, we were able to build a tool that enhanced data quality and ultimately contributed to better deep learning model performance.

The application became a key part of the data scientists' workflow, improving their overall satisfaction and productivity. The gamification elements proved particularly effective — the Leadership Board and reward system created a sense of community and achievement that sustained engagement across long annotation sprints.

👥

User Roles Defined

📊

85.5%

SUS Score (V1)

🚀

↑

Team Productivity

UX Case Study

Discovery

Clue

Redesigning data annotation for deep learning — turning a tedious, error-prone workflow into an engaging, gamified mission platform.

UX / UI Design

User Research

Prototyping & Testing

Aug 2020 – Jun 2021

User Roles

67.6

SUS Score

1 Year

Duration

Enter your credentials to access your workspace.

Email / Username

researcher@institution.edu

Password

••••••••••••

Keep me signed in

Forgot password?

Or continue with

Microsoft SSO

Google Workspace

9:41

Mon, Apr 27

79%

Welcome back,

Researcher

Your annotations help accelerate drug target discovery for millions of patients worldwide.

"Discovery Clue changed how our team manages annotation workflows. The quality of data is remarkable."

— Dr. S. Müller, ETH Zürich

9:41

Mon, Apr 27

79%

Discovery Clue

Research Annotation Platform

Disease

Target

Role

Paper

Choose your difficulty level

Cystic Fibrosis → CFTR Trafficking · Step 3 of 4

Selector

Beginner

+50 pts / paper

Review and categorise research papers by Disease, Gene, and Variant type before they enter the annotation queue.

Key Skills

Paper classification

PubMed navigation

Domain literacy

SELECTED

Annotator

Intermediate

+100 pts / paper

Label Disease–Gene–Variant–Assay–Effect relationships within selected paper abstracts and full text.

Key Skills

Variant interpretation

Gene ontology

Structured data entry

Reviewer

Expert

+200 pts / review

Validate and approve annotations from Selectors and Annotators. Final quality gate before data enters the pipeline.

Key Skills

Clinical expertise

Variant curation

Data QC & audit

Back

Continue

9:41

Mon, Apr 27

79%

Discovery Clue

Research Platform

847 pts

Good morning

Amir Karimi 👋

Lv 3 · 1,840 / 2,700 XP

Missions

#12

Rank

Done

1,840

Points

Active Missions

View all →

Cystic Fibrosis

CFTR Trafficking · 3 / 7 steps

+500 pts

Alzheimer's Disease

Tau Propagation · 1 / 5 steps

+300 pts

Breast Cancer

BRCA1 Expression · 2 / 6 steps

+400 pts

Available to Join

2 NEW

Browse all →

Parkinson's Disease

NEW

LRRK2 Pathway · 4 papers · BioRxiv

+300 pts

+ Join

Inflammatory Bowel

NEW

TNF-α Signalling · 4 papers · IJR

€10 voucher

+ Join

9:41

Mon, Apr 27

79%

Disease

Target

Role

Paper

Select your Disease Mystery Game

Your annotation effort directly accelerates drug target identification. Choose the disease area you'd like to support.

Cystic Fibrosis

CF · CFTR · Fibrosis

142 papers

38 genes

Inflammatory & Metabolic Disorders

IMD · Metabolism

89 papers

21 genes

Alzheimer's Disease

AD · Neurodegeneration

214 papers

63 genes

Breast Cancer

BC · Oncology · BRCA

178 papers

54 genes

Type 2 Diabetes

T2D · Metabolic

103 papers

29 genes

Parkinson's Disease

PD · Neurodegeneration

96 papers

31 genes

Back

Continue

9:41

Mon, Apr 27

79%

Discovery Clue

Annotate.

Accelerate.

Discover.

Label Disease–Gene–Variant relationships in medical papers and fuel the next breakthrough in precision medicine.

Get Started

Already a member? Sign In

12K+

Papers

98%

Accuracy

420

Researchers

Architecture

Information Architecture

A structured map of all six platform areas — showing how the app flows from the Homepage hub through each section's screens and role-based sub-states.

Discovery Clue — Navigation Map

Entry / Hub

App Area

Screen

Sub-state

Entry

App Area

Screens

Sub-state

🏠

Homepage

Authentication

Dashboard

Mission Setup

User Roles

Annotation Workspace

Rewards

Onboarding

Active Missions

My Stats & XP

Leadership Preview

New Missions

Disease Selection

Mission Target

Selector

Annotator

Reviewer

Protein Q&A

Full Paper Reading

Change Paper

Receiving XP

Leadership Board

Answers Questions

Muhammad Inzimam Saghir