CSV Format Requirements

Base Required Columns (All Modes)

ColumnTypeDescription
turn_idintegerUnique identifier for each utterance
speakerstringSpeaker identifier (e.g., "Teacher", "S1")
startfloatStart time in seconds
endfloatEnd time in seconds
utterancestringThe spoken text

Mode-Specific Columns

ColumnModeDescription
context_idContextGroups turns into contexts
targetContext1 or true for the turn to annotate
pair_idPairGroups 2 turns for comparison
batch_idBatchGroups turns into batches

Example CSV (All Columns)

A single CSV file can include all mode-specific columns, allowing you to use the same file with different annotation modes:

turn_id,speaker,start,end,utterance,context_id,target,pair_id,batch_id
1,Teacher,0.0,2.5,"Hello class",1,,1,batch_001
2,Teacher,3.0,5.2,"Today's topic",1,1,1,batch_001
3,Student,6.1,8.9,"I have a question",2,,2,batch_001
4,Teacher,9.0,11.0,"Yes, go ahead",2,1,2,batch_002

Tips

  • Time values (start, end) should be in seconds with decimal precision
  • Use double quotes around utterances containing commas
  • Empty cells are allowed for mode-specific columns you don't need
  • The target column accepts: 1, true, TRUE, or empty
  • Keep CSV files under 10 MB for best performance