Examples

Audio examples across a range of bioacoustics tasks and datasets. Each example shows the input audio, the prompt used, the model’s prediction, and the gold label.

Species Detection

What are the common names for the species in the audio, if any?
AudioPredictionGold LabelDataset
0:00
Northern Elephant SealMeerkat close call DCASE
0:00
Black-throated Green WarblerBlack-throated Green Warbler, Eastern Towhee ENABirds
0:00
Kentucky WarblerKirtland's Warbler, American Crow ENABirds
0:00
Chestnut-capped BrushfinchNone ENABirds
0:00
Red-legged ThrushRed-legged thrush RFCX
0:00
Puerto Rican BullfinchPuerto Rican bullfinch RFCX
0:00
Puerto Rican CoquiNone RFCX
0:00
Boreal Chorus FrogMinke whale HICEAS

Species Identification

What is the common name for the focal species in the audio?
AudioPredictionGold LabelDataset
0:00
Humpback WhaleHumpback Whale Watkins
0:00
WalrusWalrus Watkins
0:00
Greater YellowlegsGreater Yellowlegs CBI
0:00
Blue-winged TealBlue-winged Teal CBI
0:00
Mexican Free-tailed BatCommon Mosquito HumbugDB
0:00
Dusky White-eyeDusky White-eye Unseen (zero-shot)
0:00
Brown-throated SunbirdFire-tailed Sunbird Unseen (zero-shot)
What is the scientific name for the focal species in the audio?
AudioPredictionGold LabelDataset
0:00
Tauraco fischeritauraco fischeri Unseen (zero-shot)
0:00
Luscinia cyanelarvivora cyane Unseen (zero-shot)
0:00
Nisaetus cirrhatusNisaetus philippensis Unseen (zero-shot)

Call Type

Which of these, if any, are present? Single pulse gibbon call, Multiple pulse gibbon call, Gibbon duet, None.
AudioPredictionGold LabelDataset
0:00
Multiple pulse gibbon callMultiple pulse gibbon call Hainan Gibbons
0:00
Multiple pulse gibbon callMultiple pulse gibbon call Hainan Gibbons
What type of vocalization is heard from the focal species in the audio? Answer with 'call' or 'song'.
AudioPredictionGold LabelDataset
0:00
callcall --
0:00
songsong --
0:00
songsong --

Life Stage

What is the life stage of the focal species in the audio?
AudioPredictionGold LabelDataset
0:00
Adultjuvenile --
0:00
Adultadult --

Audio Captioning

Caption the audio, using the common name for any animal species.
AudioPredictionGold LabelDataset
0:00
New Zealand Bellbird singing. The common evening song of a Mainland New Zealand Bellbird. --
0:00
The sound of a Squirrel Treefrog. The sound of Squirrel Treefrog after a rain. --

General Sound Classification

Classify the sound into one of the following categories: dog, rooster, pig, cow, frog, cat, hen, insects, sheep, crow, rain, sea_waves, crackling_fire, crickets, chirping_birds, water_drops, wind, pouring_water, toilet_flush, thunderstorm, crying_baby, sneezing, clapping, breathing, coughing, footsteps, laughing, brushing_teeth, snoring, drinking_sipping, door_wood_knock, mouse_click, keyboard_typing, door_wood_creaks, can_opening, washing_machine, vacuum_cleaner, clock_alarm, clock_tick, glass_breaking, helicopter, chainsaw, siren, car_horn, engine, train, church_bells, airplane, fireworks, hand_saw
AudioPredictionGold LabelDataset
0:00
chirping_birdsdog ESC-50
0:00
catcat ESC-50

Counting

How many birds are in the audio? Choose between 1, 2, 3 or 4.
AudioPredictionGold LabelDataset
0:00
11 ZF-NBirds
0:00
13 ZF-NBirds