Data Delivery Structure
Data Overview
This is a practical summary of what you actually receive in a PoC pack: scenes, metadata, QC signals, and delivery structure. Start small, validate fit, then scale.
Review real-world failure scenarios before committing to a larger scope.
Includes sample video clips, JSON metadata, and schema preview.
No payment. No commitment. Reply within 24 hours.
Goal: reduce integration friction. You should be able to run your pipeline end-to-end within days, not weeks.
What you get in a PoC pack
- MP4 clips (short segments suitable for fast iteration)
- JSON metadata per clip (scene type + operational notes)
- High-level QC summary (accept/reject signals and reasons)
- Coverage notes (what is included, what is excluded)
- Non-binding delivery structure preview for validation
We keep the format simple: video + metadata. You can add labels later if your team wants labeling or evaluation tasks.
Download full documentation: Technical Summary (PDF) ↓ · License Summary (PDF) ↓
Exactly what you receive
A practical breakdown of what is included in a PoC delivery, what can be added after validation, and what is not included by default.
Included in PoC
- MP4 sample sequences
- Segment-level JSON metadata
- QC summary
- Schema preview / manifest
Optional Add-ons
- Annotation / labeling
- Custom scenario expansion
- Custom delivery format
Not Included
- Exclusive rights
- Resale rights
- Full production licensing
Dataset snapshot
120+
hours of raw footage
18,000+
video segments
4
cities
3
countries
Coverage is continuously expanding as new collection cycles are completed.
Coverage Table
A simplified snapshot of current collection coverage. Detailed breakdowns and sample packs are shared during qualification.
Swipe horizontally to view full coverage details.
| City / Country | Collection Window | Primary Scenarios | Day / Night | Weather | Traffic Density | Segment / Footage Scale |
|---|---|---|---|---|---|---|
| Dhaka, Bangladesh | Q4 2025 – Q1 2026 | Urban intersections, mixed traffic | Mostly Day, Partial Night | Clear, Overcast, Light Rain | Medium to Heavy | 10,000+ segments |
| Chattogram, Bangladesh | Q4 2025 – Q1 2026 | Dense mixed vehicle flow | Day | Humid, Overcast | Medium to Heavy | Growing weekly |
| Secondary Urban Coverage | Rolling collection | Market streets, narrow roads | Day / Select Night | Variable | Medium | Available on request |
Representative public snapshot. Detailed breakdowns available during buyer qualification.
Coverage at a glance
Our core focus is high-entropy traffic environments and edge-like everyday reality: mixed agents, dense interactions, and non-lane-based flow.
Representative scene categories
- Unstructured urban flow (mixed agents, non-lane-based)
- Dense intersections and merges
- Near-field pedestrian interactions
- Motorcycle-heavy corridors
- Night, rain, glare, occlusion-heavy moments
Need a specific city pattern or traffic behavior? Ask for a Coverage Snapshot and we will reply with what is feasible and how fast.
Dataset schema overview
Each delivery includes structured video segments and consistent metadata so your team can filter, audit, and integrate quickly. The schema is intentionally simple for PoC speed while remaining stable enough for larger training pipelines.
Core fields
- segment_id — stable ID used for joins and traceability
- scene_type — high-level environment category
- short_description — short natural language description
- capture_context — optional context such as time_of_day or platform
- qc_status — basic QC result for the segment
- qc_notes — brief QC reviewer notes
Example segment metadata (illustrative)
{
"segment_id": "IN-BLR-2026-02-000123",
"scene_type": "unstructured_urban_flow",
"short_description": "Dense mixed traffic with frequent cut-ins and near-field pedestrians.",
"capture_context": {
"time_of_day": "day",
"weather": "clear",
"platform": "two_wheeler"
},
"qc_status": "pass",
"qc_notes": "Stable exposure, minimal obstruction, continuous forward view."
}
This example illustrates the structure. Final schema definitions and allowed values are shared during the PoC stage so your pipeline can validate exact keys.
Quality control summary
Quality control focuses on technical usability rather than visual perfection. Each segment is checked for video integrity, basic visibility, continuity, and metadata completeness to ensure the data can be reliably ingested into training pipelines.
QC checks (overview)
- Video integrity — corruption, missing frames, encoding errors
- Exposure and visibility — severe blur or unusable lighting
- Obstruction — camera blocked or heavy handling artifacts
- Segment continuity — stable forward motion within the clip
- Metadata completeness — required fields present
For PoC deliveries, QC remains lightweight and transparent. For larger production programs, QC can be extended with additional checks and sampling.
QC Structure, Metadata Preview, and Integration Notes
This section is designed for technical buyers who want to review filtering principles, example scenario structure, metadata format, and a simple ingestion path before requesting a PoC.
QC filtering principles
- Technical usability first — segments are reviewed for visibility, continuity, and ingestion readiness.
- Low-information segments — repetitive or operationally weak clips are excluded from delivery priority.
- High-entropy interactions — dense mixed traffic, ambiguity, and near-field interactions are prioritized.
- Metadata consistency — required keys must be present for segment-level delivery.
- Delivery rule — rejected segments are excluded from delivery datasets.
For PoC, the goal is transparent filtering structure rather than overengineered scoring. Expanded QC logic can be introduced for larger production programs.
Scenario categories and distribution logic
- Unsignalized intersections
- Mixed-agent urban flow
- Pedestrian negotiation zones
- Motorbike-heavy corridors
- Occlusion, glare, rain, and dense compression scenes
Distribution snapshot is illustrative. The collection strategy is intentionally biased toward scenarios that are more useful for evaluation, failure analysis, and hard-case review.
Sample metadata JSON
{
"segment_id": "BD-DHK-2026-000182",
"file_name": "BD-DHK-2026-000182.mp4",
"duration_sec": 42,
"scene_type": "unsignalized_intersection",
"short_description": "Dense mixed traffic with pedestrian negotiation near junction entry.",
"capture_context": {
"time_of_day": "day",
"weather": "overcast",
"platform": "two_wheeler"
},
"qc_status": "pass",
"qc_notes": "Continuous forward view, acceptable visibility, stable enough for PoC ingestion."
}
Preview structure or test integration using a real sample file.
Use this file to test parsing, mapping, and internal evaluation workflows before requesting a full sample pack.
Integration hint
- Delivery format — MP4 clips + JSON metadata
- File naming — one segment ID maps to one video file and one metadata entry
- Clip-to-metadata mapping — use
segment_idas the stable join key - Schema simplicity — lightweight fields for fast PoC validation
Typical ingestion flow
- Download sample package
- Parse JSON metadata
- Map
segment_idto clip filename - Run ingestion and compatibility checks
- Expand coverage after validation
The objective is to reduce internal review friction and make technical validation possible without requiring custom tooling first.
How to evaluate with a PoC
Start with a small PoC dataset, validate integration with your pipeline, then expand coverage once the data proves useful for your model training.
Recommended evaluation steps
- Run your ingestion pipeline (MP4 + JSON)
- Verify metadata parsing and schema compatibility
- Filter by scene_type and inspect environment diversity
- Run a small training or evaluation experiment
- Decide next data scale and scene coverage targets
Get a sample plan tailored to your evaluation setup.
We will respond with a practical data plan based on your target environment, licensing constraints, and evaluation workflow.
FAQ
Can you provide labels?
Yes, but PoC usually starts with video + metadata first. After you validate value, we can add labeling or evaluation tasks based on your needs.
Do you support custom scene requests?
Yes. Send a short requirement list and we will reply with feasibility, timeline, and minimum order assumptions.
What about legal and privacy?
Understand how data is collected, structured, and used during PoC and evaluation.
See the Legal and Transparency page for our approach and PoC usage framing.
No sensitive personal data. Clear usage scope defined for PoC evaluation.
Ready to validate in your pipeline?
Tell us your target environment and failure cases. We will recommend a PoC tier and a coverage plan.
Start with a small sample before scaling to a larger dataset.
No payment. No commitment. Reply within 24 hours.