ShareChat
Moj

3MASSIV

Moj
X
ShareChat
X

A large scale multilingual, multiaspect and multimodal dataset of expertly-annotated short videos from Moj.

3MASSIV is a human-annotated, culturally diverse, multimodal and multilingual dataset containing short videos uploaded to short video application - Moj.

34 popular social media concepts

Comedy

Romance

Pranks

Fails

Memes

Food

Dance

Music

Fashion

... and many more !

11 Indic Languages

ShareChat in Hindi
ShareChat in Marathi
ShareChat in Gujarati
ShareChat in Punjabi
ShareChat in Telugu
ShareChat in Malayalam
ShareChat in Tamil
ShareChat in Bengali
ShareChat in Kannada
ShareChat in Bhojpuri
ShareChat in Haryanvi

50K labelled and 100K unlabelled videos short videos (~10-20 seconds)

Diverse Video Types

play

Reaction Videos

play

Split Screens

play

Self-Shot Videos

play

Animations

play

Movie/TV-Shows Clips

Diverse Audio Types

play

Self-sung songs

play

Lip-syncs

play

Dialogues

play

Monologues

play

Background Music

Human Annotations

Every video has been annotated by three expert reviewers with each language having dedicated reviewers.

3MASSIV is very rich in annotations and can be used for exploring various research directions:

Concept Understanding
Affective Computing
Language Prediction
Media Format Identification
Temporal Analysis

Team

License

3MASSIV dataset is available for only reseach purposes. Any commercial use of the dataset is strictly prohibited.

ShareChat
Moj

Follow Us

Copyright © 2025 Mohalla Tech Private Limited.