TECHNOLOGY

THE TECH STACK

Speech
processing tools

Language Detection

Transcription

Automatically identifies the primary spoken or written language of assets.

Converts audio dialogue into time-coded text for searchable scripts.

Translation

Subtitling

Converts transcripts or text into supported target languages for localization.

Produces synchronized .srt files for multilingual captions and accessibility.

Dubbing

Multilingual Cross-Node Search

Replaces original audio with AI-generated voices in target languages.

Queries and retrieves assets across federated nodes in multiple languages.

Tagging

Text Too Speech

Extracts relevant keywords and descriptive labels from visual content.

Transforms written text into natural-sounding audio with various voices.

Video Editing Tools

Bulk Upload

Provides browser-based trimming and clipping for fast content repurposing.

Submit a csv file containing multiple video, choose the tools to be applied and let the platform do the work.

THE TECH STACK

Content
intelligence

Captioning

Object Detection

Generates descriptive sentences explaining the visual context of media.

Identifies and tracks specific physical objects appearing within frames.

Action Recognition

Moderation

Detects and classifies specific human activities or events in videos.

Flags inappropriate content including NSFW, memes, and disturbing imagery.

Metadata Enrichment

News Categorization

Automatically categorizes textual contents like news and articles.

VIDEO EDITING TOOLS

Manipulate video
resources

Format Conversion

Frame Extraction

Converts original video files into the following standard formats outlined below, for compatibility and enabled asset download: MP4, MOV, MKV, AVI, WMV.

Allows users to select a specific frame from the video timeline to generate high quality still images or GIF previews.

Scaling & Dropping

Watermarking

Provides tools for resizing videos to predefined or free-form dimensions and cropping frames to specific aspect ratios.

Enables the application of custom watermarks. Users can configure the position (corners or center) and scaling factor of the watermark image.

Trimming

Audio Extraction

A timeline-based interface allows users to define start and end points to extract specific clips from a longer recording.

Strips the audio track from video files, providing it as a standalone audio asset (essential for separate speech-to-text processing).

GIF Conversion

GIF Preview

Conversion Converts the video to a series of rolling images.

Creates an animated GIF preview of the video. This capability is also used internally to generate an animated series of thumbnails that are displayed in the assets page as a preview for each video.

USER PROFILES MANAGEMENT

Broadcaster (premium) vs
End user (standard)

The MOSAIC platform offers tailored access levels to ensure

both professional control and seamless content discovery

Broadcaster Access (Full)

Designed for media professionals and content managers. This role grants full operational power, including asset ingestion (upload and bulk import), visibility management (public vs. private), and metadata editing. Broadcasters can trigger AI tools like Text-to-Speech generation, use advanced video editing tools, and have direct download permissions for all assets.

User Profile (Consult & Discovery)

Optimized for end-users and citizens focused on content consumption. This role allows for multilingual search, viewing of public assets, and access to already AI-generated subtitles and summaries. However, to protect intellectual property, «User» profiles typically have restricted direct download capabilities and cannot initiate new AI processing tasks.

THE TECH STACK

Speechprocessing tools

Language Detection

Transcription

Translation

Subtitling

Dubbing

Multilingual Cross-Node Search

Tagging

Text Too Speech

Video Editing Tools

Bulk Upload

THE TECH STACK

Contentintelligence

Captioning

Object Detection

Action Recognition

Moderation

Metadata Enrichment

News Categorization

VIDEO EDITING TOOLS

Manipulate videoresources

Format Conversion

Frame Extraction

Scaling & Dropping

Watermarking

Trimming

Audio Extraction

GIF Conversion

GIF Preview

USER PROFILES MANAGEMENT

Broadcaster (premium) vsEnd user (standard)

User Profile (Consult & Discovery)

USER DASHBOAR

Assets list

SINGLE ASSET PAGE

General information

ANNOTATIONS

Tags, objects, actions andpersons AI-detected

SPEECH PROCESSING TOOLS

Transcription & translation

VIDEO EDITING TOOLS

Edit your contents insideMOSAIC

DEEPFAKE DETECTION

Check your video for AImanipulation / generation

BULK UPLOAD CAPABILITY

Avoid the interface when youhave several videos to upload

Speech
processing tools

Content
intelligence

Manipulate video
resources

Broadcaster (premium) vs
End user (standard)

Tags, objects, actions and
persons AI-detected

Edit your contents inside
MOSAIC

Check your video for AI
manipulation / generation

Avoid the interface when you
have several videos to upload