Shopping Cart

No products in the cart.

BSI 22/30464047 DC 2022

$45.21

BS EN ISO/IEC 14496-12. Information technology. Coding of audio-visual objects – Part 12. ISO base media file format

Published By Publication Date Number of Pages
BSI 2022 329
Guaranteed Safe Checkout
Category:

If you have any questions, feel free to reach out to our online customer service team by clicking on the bottom right corner. We’re here to assist you 24/7.
Email:[email protected]

PDF Catalog

PDF Pages PDF Title
15 1 Scope
2 Normative references
16 3 Terms, definitions and abbreviated terms
3.1 Terms and definitions
3.1.1 access unit
3.1.2 box
17 3.1.3 chunk
3.1.4 clean aperture
3.1.5 container box
3.1.6 EDRAP sample
3.1.7 file level
3.1.8 full aperture
3.1.9 hint track
3.1.10 hinter
3.1.11 index file
3.1.12 ISO base media file
18 3.1.13 item
3.1.14 leading sample
3.1.15 leaf subsegment
3.1.16 mod
3.1.17 media data box
3.1.18 metadata file
3.1.19 movie box
3.1.20 movie file
3.1.21 movie fragment
3.1.22 structure-data
19 3.1.23 movie-fragment relative addressing
3.1.24 open random access point
3.1.25 pixel aspect ratio
3.1.26 preselection
3.1.27 presentation
3.1.28 presentation time
3.1.29 random access
3.1.30 timeline
3.1.31 timescale
20 3.1.32 timestamp
3.1.33 decoding timestamp
3.1.34 composition timestamp
3.1.35 sample duration
3.1.36 random access point
RAP
3.1.37 random access recovery point
3.1.38 sample
3.1.39 sample entry
21 3.1.40 sample entry type
3.1.41 untransformed sample entry type
3.1.42 sample number
3.1.43 sample table
3.1.44 sync sample
3.1.45 segment
3.1.46 segment file
3.1.47 subsegment
3.1.48 thumbnail image
3.1.49 top-level box
22 3.1.50 track
3.1.51 haptic media
3.1.52 volumetric visual media
3.2 Abbreviated terms
4 Object-structured file organization
4.1 File structure
23 4.2 Object structure
4.2.1 Object syntax conventions
4.2.2 Object definitions
25 4.2.3 Extensibility of object definitions
26 4.3 File-type box
4.3.1 Definition
27 4.3.2 Syntax
4.3.3 Semantics
4.4 Extended type box
4.4.1 Definition
28 4.4.2 Syntax
4.4.3 Semantics
5 Structure of this document
29 6 ISO base media file organization
6.1 Files, segments, and streams
6.2 Presentation structure
6.2.1 Object structure of a presentation
30 6.2.2 Meta data and media data
6.2.3 Data offsets
31 6.3 Structure-data (objects)
6.3.1 Box
32 6.3.2 Data types and fields
33 6.3.3 URIs as type indicators
34 6.3.4 Box order
37 6.4 Time structure overview
38 6.5 Identifiers
6.6 Brand identification
6.7 Uniform resource locators (URLs)
7 Streaming support
8 Box structures
8.1 File structure and general boxes
8.1.1 Media data box
8.1.1.1 Definition
39 8.1.1.2 Syntax
8.1.1.3 Semantics
8.1.2 Free space box
8.1.2.1 Definition
8.1.2.2 Syntax
8.1.2.3 Semantics
8.1.3 Progressive download information box
8.1.3.1 Definition
40 8.1.3.2 Syntax
8.1.3.3 Semantics
8.1.4 Identified media data box
8.1.4.1 Definition
8.1.4.2 Syntax
8.1.4.3 Semantics
41 8.2 Movie structure
8.2.1 Movie box
8.2.1.1 Definition
8.2.1.2 Syntax
8.2.2 Movie header box
8.2.2.1 Definition
42 8.2.2.2 Syntax
8.2.2.3 Semantics
43 8.3 Track structure
8.3.1 Track box
8.3.1.1 Definition
8.3.1.2 Syntax
8.3.2 Track header box
8.3.2.1 Definition
45 8.3.2.2 Syntax
8.3.2.3 Semantics
47 8.3.3 Track reference box
8.3.3.1 Definition
8.3.3.2 Syntax
8.3.3.3 Semantics
49 8.3.3.4 Track reference definitions
8.3.3.4.1 Associated external stream track reference
8.3.4 Track group box
8.3.4.1 Definition
50 8.3.4.2 Syntax
8.3.4.3 Semantics
8.3.4.4 Track group definitions
8.3.4.4.1 Multi-source presentation
51 8.3.4.4.2 Stereoscopic pair
8.3.4.4.2.1 Definition
8.3.4.4.2.2 Syntax
8.3.4.4.2.3 Semantics
8.3.4.4.3 Preselection group box
8.3.4.4.3.1 Definition
52 8.3.4.4.3.2 Syntax
8.3.4.4.3.3 Semantics
8.3.4.4.3.4 Preselection processing box
8.3.4.4.3.4.1 Definition
8.3.4.4.3.4.2 Syntax
8.3.4.4.3.4.3 Semantics
53 8.3.5 Track group description box
8.3.5.1 Definitions
8.3.5.2 Syntax
54 8.3.5.3 Semantics
8.3.5.4 Track group entry definitions
8.3.5.4.1 Preselection track group entry box
8.3.5.4.1.1 Definition
8.3.5.4.1.2 Syntax
55 8.3.5.4.1.3 Semantics
56 8.3.6 Track type box
8.3.5.1 Definition
8.3.5.2 Syntax
8.4 Track media structure
8.4.1 Media box
8.4.1.1 Definition
8.4.1.2 Syntax
57 8.4.2 Media header box
8.4.2.1 Definition
8.4.2.2 Syntax
8.4.2.3 Semantics
58 8.4.3 Handler reference box
8.4.3.1 Definition
8.4.3.2 Syntax
8.4.3.3 Semantics
8.4.4 Media information box
8.4.4.1 Definition
59 8.4.4.2 Syntax
8.4.5 Media information header boxes
8.4.5.1 Definition
8.4.5.2 Null media header box
8.4.5.2.1 Definition
8.4.5.2.2 Syntax
8.4.5.2.3 Semantics
8.4.6 Extended language tag
8.4.6.1 Definition
60 8.4.6.2 Syntax
8.4.6.3 Semantics
8.5 Sample tables
8.5.1 Sample table box
8.5.1.1 Definition
61 8.5.1.2 Syntax
8.5.2 Sample description box
8.5.2.1 Definition
62 8.5.2.2 Syntax
8.5.2.3 Semantics
63 8.5.3 Degradation priority box
8.5.3.1 Definition
8.5.3.2 Syntax
8.5.3.3 Semantics
8.5.4 Sample scale box
64 8.6 Track time structures
8.6.1 Time to sample boxes
8.6.1.1 Definition
65 8.6.1.2 Decoding time to sample box
8.6.1.2.1 Definition
66 8.6.1.2.2 Syntax
8.6.1.2.3 Semantics
8.6.1.3 Composition time to sample box
8.6.1.3.1 Definition
68 8.6.1.3.2 Syntax
8.6.1.3.3 Semantics
8.6.1.4 Composition to decode box
8.6.1.4.1 Definition
69 8.6.1.4.2 Syntax
8.6.1.4.3 Semantics
70 8.6.2 Sync sample box
8.6.2.1 Definition
8.6.2.2 Syntax
8.6.2.3 Semantics
8.6.3 Shadow sync
8.6.3.1 Shadow sync support
71 8.6.3.2 Shadow sync sample box
8.6.3.2.1 Definition
72 8.6.3.2.2 Syntax
8.6.3.2.3 Semantics
8.6.4 Independent and disposable samples box
8.6.4.1 Definition
73 8.6.4.2 Syntax
8.6.4.3 Semantics
74 8.6.5 Edit box
8.6.5.1 Definition
8.6.5.2 Syntax
8.6.6 Edit list box
8.6.6.1 Definition
75 8.6.6.2 Syntax
8.6.6.3 Semantics
76 8.6.6.4 Edit list examples
77 8.7 Track data layout structures
8.7.1 Data information box
8.7.1.1 Definition
8.7.1.2 Syntax
8.7.2 Data reference box
8.7.2.1 Definition
78 8.7.2.2 Syntax
79 8.7.2.3 Semantics
8.7.3 Sample size boxes
8.7.3.1 Definition
8.7.3.2 Sample size box
8.7.3.2.1 Syntax
80 8.7.3.2.2 Semantics
8.7.3.3 Compact sample size box
8.7.3.3.1 Syntax
8.7.3.3.2 Semantics
8.7.4 Sample to chunk box
8.7.4.1 Definition
81 8.7.4.2 Syntax
8.7.4.3 Semantics
8.7.5 Chunk offset box
8.7.5.1 Definition
82 8.7.5.2 Syntax
8.7.5.3 Semantics
8.7.6 Padding bits box
8.7.6.1 Definition
8.7.6.2 Syntax
83 8.7.6.3 Semantics
8.7.7 Sub-sample information box
8.7.7.1 Definition
84 8.7.7.2 Syntax
8.7.7.3 Semantics
85 8.7.8 Sample auxiliary information sizes box
8.7.8.1 Definition
86 8.7.8.2 Syntax
8.7.8.3 Semantics
8.7.9 Sample auxiliary information offsets box
8.7.9.1 Definition
87 8.7.9.2 Syntax
8.7.9.3 Semantics
8.8 Movie fragments
8.8.1 Movie extends box
8.8.1.1 Definition
88 8.8.1.2 Syntax
89 8.8.2 Movie extends header box
8.8.2.1 Definition
8.8.2.2 Syntax
8.8.2.3 Semantics
8.8.3 Track extends box
8.8.3.1 Definition
90 8.8.3.2 Syntax
8.8.3.3 Semantics
8.8.4 Movie fragment box
8.8.4.1 Definition
91 8.8.4.2 Syntax
8.8.5 Movie fragment header box
8.8.5.1 Definition
8.8.5.2 Syntax
8.8.5.3 Semantics
8.8.6 Track fragment box
8.8.6.1 Definition
92 8.8.6.2 Syntax
8.8.7 Track fragment header box
8.8.7.1 Definition
94 8.8.7.2 Syntax
8.8.7.3 Semantics
8.8.8 Track fragment run box
8.8.8.1 Definition
95 8.8.8.2 Syntax
96 8.8.8.3 Semantics
8.8.9 Movie fragment random access box
8.8.9.1 Definition
8.8.9.2 Syntax
8.8.10 Track fragment random access box
8.8.10.1 Definition
97 8.8.10.2 Syntax
8.8.10.3 Semantics
98 8.8.11 Movie fragment random access offset box
8.8.11.1 Definition
8.8.11.2 Syntax
8.8.11.3 Semantics
8.8.12 Track fragment decode time box
8.8.12.1 Definition
99 8.8.12.2 Syntax
8.8.12.3 Semantics
8.8.13 Level assignment box
8.8.13.1 Definition
100 8.8.13.2 Syntax
8.8.13.3 Semantics
101 8.8.14 Sample auxiliary information in movie fragments
102 8.8.15 Track Extension Properties box
8.8.15.1 Definition
8.8.15.2 Syntax
8.8.15.3 Semantics
8.8.16 Alternative startup sequence properties box
8.8.16.1 Definition
103 8.8.16.2 Syntax
8.8.16.3 Semantics
8.8.17 Metadata and user data in movie fragments
104 8.9 Sample group structures
8.9.1 Overview
105 8.9.2 Sample to group box
8.9.2.1 Definition
8.9.2.2 Syntax
106 8.9.2.3 Semantics
8.9.3 Sample group description box
8.9.3.1 Definition
107 8.9.3.2 Syntax
108 8.9.3.3 Semantics
109 8.9.4 Representation of group structures in movie fragments
110 8.9.5 Compact sample to group box
8.9.5.1 Definition
111 8.9.5.2 Syntax
8.9.5.3 Semantics
112 8.10 User data
8.10.1 User data box
8.10.1.1 Definition
113 8.10.1.2 Syntax
8.10.2 Copyright box
8.10.2.1 Definition
8.10.2.2 Syntax
8.10.2.3 Semantics
8.10.3 Track selection box
8.10.3.1 Overview
114 8.10.3.2 Definition
8.10.3.3 Syntax
8.10.3.4 Semantics
8.10.3.5 Attributes
115 8.10.4 Track kind
8.10.4.1 Definition
116 8.10.4.2 Syntax
8.10.4.3 Semantics
8.10.5 Label box
8.10.5.1 Definition
8.10.5.2 Syntax
8.10.5.3 Semantics
117 8.11 Metadata support
8.11.1 MetaBox
8.11.1.1 Definition
118 8.11.1.2 Syntax
8.11.2 XML boxes
8.11.2.1 Definition
119 8.11.2.2 Syntax
8.11.2.3 Semantics
8.11.3 Item location box
8.11.3.1 Definition
122 8.11.3.2 Syntax
8.11.3.3 Semantics
123 8.11.4 Primary item box
8.11.4.1 Definition
8.11.4.2 Syntax
8.11.4.3 Semantics
8.11.5 Item protection box
8.11.5.1 Definition
124 8.11.5.2 Syntax
8.11.6 Item information box
8.11.6.1 Definition
125 8.11.6.2 Syntax
126 8.11.6.3 Semantics
127 8.11.7 Additional metadata container box
8.11.8 Metabox Relation box
8.11.9 URL forms for MetaBoxes
128 8.11.10 Static metadata
8.11.10.1 General
8.11.10.2 Simple textual
8.11.10.3 Other forms
8.11.10.4 MPEG-7 metadata
129 8.11.11 Item data box
8.11.11.1 Definition
8.11.11.2 Syntax
8.11.11.3 Semantics
8.11.12 Item reference box
8.11.12.1 Definition
130 8.11.12.2 Syntax
8.11.12.3 Semantics
8.11.13 Auxiliary video metadata
8.11.14 Item properties box
8.11.14.1 Definition
131 8.11.14.2 Syntax
132 8.11.14.3 Semantics
8.11.15 Brand item property
8.11.15.1 Definition
133 8.11.15.2 Syntax
8.11.16 Handler property
8.11.16.1 Definition
8.11.16.2 Syntax
8.11.16.3 Semantics
134 8.12 Support for protected streams
8.12.1 Overview
135 8.12.2 Protection scheme information box
8.12.2.1 Definition
136 8.12.2.2 Syntax
8.12.3 Original format box
8.12.3.1 Definition
8.12.3.2 Syntax
8.12.3.3 Semantics
8.12.4 IPMPInfoBox
8.12.5 IPMP control box
137 8.12.6 Scheme type box
8.12.6.1 Definition
8.12.6.2 Syntax
8.12.6.3 Semantics
8.12.7 Scheme information box
8.12.7.1 Definition
8.12.7.2 Syntax
138 8.12.8 Scramble Scheme Information Box
8.12.8.1 Definition
8.12.8.2 Syntax
8.13 File delivery format support
8.13.1 Overview
139 8.13.2 FD item information box
8.13.2.1 Definition
8.13.2.2 Syntax
8.13.2.3 Semantics
140 8.13.3 File partition box
8.13.3.1 Definition
8.13.3.2 Syntax
8.13.3.3 Semantics
141 8.13.4 FEC reservoir box
8.13.4.1 Definition
142 8.13.4.2 Syntax
8.13.4.3 Semantics
8.13.5 FD session group box
8.13.5.1 Definition
143 8.13.5.2 Syntax
8.13.5.3 Semantics
8.13.6 Group ID to name box
8.13.6.1 Definition
8.13.6.2 Syntax
144 8.13.6.3 Semantics
8.13.7 File reservoir box
8.13.7.1 Definition
8.13.7.2 Syntax
8.13.7.3 Semantics
8.14 Sub tracks
8.14.1 Overview
145 8.14.2 Backward compatibility
8.14.3 Sub track box
8.14.3.1 Definition
8.14.3.2 Syntax
146 8.14.4 Sub track information box
8.14.4.1 Definition
8.14.4.2 Syntax
8.14.4.3 Semantics
147 8.14.5 Sub track definition box
8.14.5.1 Definition
148 8.14.5.2 Syntax
8.14.6 Sub track sample group box
8.14.6.1 Definition
8.14.6.2 Syntax
8.14.6.3 Semantics
8.15 Restricted media tracks
8.15.1 General
149 8.15.2 Restricted sample entry transformation
150 8.15.3 Restricted scheme information box
8.15.3.1 Definition
8.15.3.2 Syntax
8.15.4 Scheme for stereoscopic video arrangements
8.15.4.1 General
151 8.15.4.2 Stereo video box
8.15.4.2.1 Definition
8.15.4.2.2 Syntax
8.15.4.2.3 Semantics
153 8.15.5 Compatible scheme type box
8.15.5.1 Definition
8.15.5.2 Syntax
8.15.5.3 Semantics
8.15.6 Sample-packed tracks
8.15.6.1 Introduction
154 8.15.6.2 Sample packing information box
8.15.6.2.1 Definition
8.15.6.2.2 Syntax
8.15.6.2.3 Semantics
8.16 Segments
8.16.1 Overview
8.16.2 Segment type box
8.16.2.1 Definition
155 8.16.2.2 Syntax
8.16.3 Segment index box
8.16.3.1 Definition
157 8.16.3.2 Syntax
8.16.3.3 Semantics
159 8.16.4 Subsegment index box
8.16.4.1 Definition
160 8.16.4.2 Syntax
161 8.16.4.3 Semantics
8.16.5 Producer reference time box
8.16.5.1 Definition
162 8.16.5.2 Syntax
8.16.5.3 Semantics
8.17 Support for incomplete tracks
8.17.1 General
163 8.17.2 Transformation
164 8.17.3 Complete track information box
8.17.3.1 Definition
8.17.3.2 Syntax
8.18 Entity grouping
8.18.1 General
8.18.2 Groups list box
8.18.2.1 Definition
165 8.18.2.2 Syntax
8.18.3 Entity to group box
8.18.3.1 Definition
8.18.3.2 Syntax
8.18.3.3 Semantics
166 8.19 Compressed boxes
8.19.1 Overview and processing
8.19.2 Processing model
8.19.2.1 File parsing processing model
167 8.19.2.2 File decompression processing model
168 8.19.3 General syntax
8.19.4 General semantics
8.19.5 Original file-type box
8.19.5.1 Definition
169 8.19.5.2 Syntax
8.19.6 Compressed movie box
8.19.6.1 Definition
8.19.6.2 Syntax
8.19.7 Compressed movie fragment box
8.19.7.1 Definition
8.19.7.2 Syntax
170 8.19.8 Compressed segment index box
8.19.8.1 Definition
8.19.8.2 Syntax
8.19.9 Compressed subsegment index box
8.19.9.1 Definition
171 8.19.9.2 Syntax
9 Hint track formats
9.1 RTP and SRTP hint track format
9.1.1 Overview
9.1.2 Sample entry format
9.1.2.1 Structure
172 9.1.2.2 SRTP process box
173 9.1.3 Sample format
9.1.3.1 Sample format definition
174 9.1.3.2 Packet entry format
175 9.1.3.3 Constructor format
There are various forms of the constructor. Each constructor is 16 bytes, to make iteration easier. The first byte is a union discriminator:
aligned(8) class RTPconstructor(type) { unsigned int(8) constructor_type = type; } aligned(8) class RTPnoopconstructor extends RTPconstructor(0) { uint(8) pad[15]; } aligned(8) class RTPimmediateconstructor extends RTPconstructor(1) { unsigned …
176 9.1.4 SDP information
9.1.4.1 Overview
9.1.4.2 Movie SDP information
177 9.1.4.3 Track SDP information
9.1.5 Statistical information
178 9.2 ALC/LCT and FLUTE hint track format
9.2.1 Overview
179 9.2.2 Design principles
180 9.2.3 Sample entry format
9.2.3.1 Definition
9.2.3.2 Syntax
181 9.2.3.3 Semantics
9.2.4 Sample format
9.2.4.1 Sample container
9.2.4.2 Packet entry format
182 9.2.4.3 LCT header template format
9.2.4.4 LCT header extension constructor format
183 9.2.4.5 Packet constructor format
184 9.2.4.6 Extra data box
9.2.4.7 FEC information box
9.2.4.7.1 Definition
9.2.4.7.2 Syntax
9.2.4.7.3 Semantics
185 9.3 MPEG-2 transport hint track format
9.3.1 Overview
9.3.2 Design principles
9.3.2.1 General principles
9.3.2.2 Reusing existing transport streams
186 9.3.2.3 Timing
9.3.2.4 Packet grouping
9.3.2.5 Random-access points
187 9.3.2.6 Application as a reception hint track
9.3.3 Sample entry format
9.3.3.1 Definition
9.3.3.2 Syntax
188 9.3.3.3 Semantics
189 9.3.4 Sample format
9.3.4.1 Definition
190 9.3.4.2 Syntax
191 9.3.4.3 Semantics
192 9.3.5 Protected MPEG 2 transport stream hint track
9.3.5.1 Overview
9.3.5.2 Syntax
9.3.5.3 Semantics
9.4 RTP, RTCP, SRTP and SRTCP reception hint tracks
9.4.1 RTP reception hint track
9.4.1.1 Overview
193 9.4.1.2 Sample entry format
194 9.4.1.3 Sample format
195 9.4.1.4 Packet entry format
196 9.4.1.5 SDP information
9.4.2 RTCP reception hint track
9.4.2.1 Overview
9.4.2.2 General
197 9.4.2.3 Sample entry format
9.4.2.4 Sample format
9.4.2.4.1 Overview
9.4.2.4.2 Syntax
9.4.2.4.3 Semantics
9.4.3 SRTP reception hint track
9.4.3.1 Overview
198 9.4.3.2 Sample entry format
9.4.3.2.1 Sample entry
9.4.3.2.2 Received cryptographic context ID box
199 9.4.3.2.3 Rollover counter box
9.4.3.3 Sample and packet entry format
9.4.4 SRTCP reception hint tracks
9.4.4.1 Overview
200 9.4.4.2 General
9.4.4.3 Sample entry format
9.4.4.4 Sample format
9.4.5 Protected RTP reception hint track
9.4.5.1 Overview
9.4.5.2 Syntax
201 9.4.5.3 Semantics
9.4.6 Recording procedure
9.4.7 Parsing procedure
10 Sample groups
10.1 Random access recovery points
10.1.1 Definition
202 10.1.2 Syntax
10.1.3 Semantics
10.2 Rate share groups
10.2.1 Overview
204 10.2.2 Rate share sample group entry
10.2.2.1 Definition
10.2.2.2 Syntax
10.2.2.3 Semantics
205 10.2.3 Relationship between tracks
10.2.4 Bitrate allocation
206 10.3 Alternative startup sequences
10.3.1 Definition
10.3.2 Syntax
10.3.3 Semantics
207 10.3.4 Examples
209 10.4 Random access point (RAP) sample group
10.4.1 Definition
10.4.2 Syntax
10.4.3 Semantics
10.5 Temporal level sample group
10.5.1 Definition
210 10.5.2 Syntax
10.5.3 Semantics
10.6 Stream access point sample group
10.6.1 Definition
211 10.6.2 Syntax
10.6.3 Semantics
10.7 Sample-to-item sample group
10.7.1 Definition
10.7.2 Syntax
10.7.3 Semantics
212 10.8 Dependent random access point (DRAP) sample group
10.8.1 Definition
10.8.2 Syntax
10.8.3 Semantics
213 10.9 Pixel Aspect Ratio Sample Grouping
10.9.1 Definition
10.9.2 Syntax
10.9.3 Semantics
10.10 Clean Aperture Sample Grouping
10.10.1 Definition
10.10.2 Syntax
214 10.10.3 Semantics
10.11 Extended DRAP (EDRAP) sample group
10.11.1 Definition
10.11.2 Syntax
10.11.3 Semantics
215 10.12 Essential descriptions hierarchy sample grouping
10.12.1 Definition
216 10.12.2 Syntax
10.12.3 Semantics
11 Derived file formats
217 12 Media-specific definitions
12.1 Video media
12.1.1 Media handler
12.1.2 Video media header
12.1.2.1 Definition
218 12.1.2.2 Syntax
12.1.2.3 Semantics
12.1.3 Sample entry
12.1.3.1 Definition
12.1.3.2 Syntax
12.1.3.3 Semantics
219 12.1.4 Pixel aspect ratio and clean aperture
12.1.4.1 Definition
220 12.1.4.2 Syntax
12.1.4.3 Semantics
12.1.5 Colour information
12.1.5.1 Definition
221 12.1.5.2 Syntax
12.1.5.3 Semantics
12.1.6 Content light level
12.1.6.1 Definition
222 12.1.6.2 Syntax
12.1.7 Mastering display colour volume
12.1.7.1 Definition
12.1.7.2 Syntax
12.1.8 Content colour volume
12.1.8.1 Definition
223 12.1.8.2 Syntax
12.1.9 Ambient viewing environment
12.1.9.1 Definition
12.1.9.2 Syntax
12.2 Audio media
12.2.1 Media handler
224 12.2.2 Sound media header
12.2.2.1 Definition
12.2.2.2 Syntax
12.2.2.3 Semantics
12.2.3 Sample entry
12.2.3.1 Definition
225 12.2.3.2 Syntax
226 12.2.3.3 Semantics
227 12.2.4 Channel layout
12.2.4.1 Definition
228 12.2.4.2 Syntax
230 12.2.4.3 Semantics
231 12.2.5 Downmix instructions
12.2.5.1 Definition
232 12.2.5.2 Syntax
12.2.5.3 Semantics
235 12.2.6 DRC information
236 12.2.7 Audio stream loudness
12.2.7.1 Definition
237 12.2.7.2 Syntax
238 12.2.7.3 Semantics
12.2.8 Audio rendering indication box
12.2.8.1 Definition
239 12.2.8.2 Syntax
12.2.8.3 Semantics
12.2.9 Audio element box
12.2.9.1 Definition
12.2.9.2 Syntax
240 12.2.10 Audio element description box
12.2.10.1 Definition
12.2.10.2 Syntax
12.2.10.3 Syntax
12.2.11 Audio element positioning interactivity polar box
12.2.11.1 Definition
241 12.2.11.2 Syntax
12.2.11.3 Semantics
12.2.12 Audio element prominence interactivity box
242 12.2.12.1 Definition
12.2.12.2 Syntax
12.2.12.3 Semantics
12.2.13 Audio element selection box
12.2.13.1 Definition
12.2.13.2 Syntax
12.2.14 Audio element selection description box
12.2.14.1 Definition
243 12.2.14.2 Syntax
12.2.14.3 Semantics
12.3 Metadata media
12.3.1 Media handler
12.3.2 Media header
12.3.3 Sample entry
12.3.3.1 Definition
244 12.3.3.2 Syntax
12.3.3.3 Semantics
245 12.4 Hint media
12.4.1 Overview
246 12.4.2 Media handler
12.4.3 Hint media header
12.4.3.1 Hint media header box
247 12.4.3.2 Syntax
12.4.3.3 Semantics
12.4.4 Sample entry
12.4.4.1 Definition
12.4.4.2 Syntax
12.5 Text media
12.5.1 Media handler
12.5.2 Media header
12.5.3 Sample entry
12.5.3.1 Definition
248 12.5.3.2 Syntax
12.5.3.3 Semantics
12.6 Subtitle media
12.6.1 Media handler
12.6.2 Subtitle media header
12.6.2.1 Definition
12.6.2.2 Syntax
12.6.2.3 Semantics
12.6.3 Sample entry
12.6.3.1 Definition
249 12.6.3.2 Syntax
12.6.3.3 Semantics
12.7 Font media
12.7.1 Media handler
12.7.2 Media header
250 12.7.3 Sample entry
12.7.3.1 Definition
12.7.3.2 Syntax
12.8 Transformed media
12.8.1 General
12.8.2 Multiple transformations for a single transformed media track
12.8.3 Determining the untransformed sample entry type
251 12.8.4 The ‘codecs’ MIME parameter for a transformed media track
12.9 Multiplexed timed metadata tracks
12.9.1 General
252 12.9.2 Overall design
12.9.3 Sample format
12.9.4 Sample entry format
12.9.4.1 General
253 12.9.4.2 Metadata key table box
12.9.4.3 Metadata key box
12.9.4.3.1 Definition
254 12.9.4.3.2 Syntax
12.9.4.4 Metadata key declaration box
12.9.4.4.1 Definition
12.9.4.4.2 Syntax
12.9.4.4.3 Semantics
255 12.9.4.5 Metadata locale box
12.9.4.5.1 Definition
12.9.4.5.2 Syntax
12.9.4.5.3 Semantics
12.9.4.6 Metadata setup box
12.9.4.6.1 Definition
12.9.4.6.2 Syntax
256 12.9.5 Defined formats
12.9.5.1 Null
12.9.5.2 User-data
12.9.5.3 Un-multiplexed timed metadata
12.9.5.4 Other forms
257 12.10 Volumetric visual media
12.10.1 Media handler
12.10.2 Media header
12.10.2.1 Definition
12.10.2.2 Syntax
12.10.3 Sample entry
12.10.3.1 Definition
12.10.3.2 Syntax
12.10.3.3 Semantics
12.10.4 Sample format
258 12.11 Haptic media
12.11.1 Media handler
12.11.2 Media header
12.11.3 Sample entry
12.11.3.1 Definition
12.11.3.2 Syntax
12.11.4 Sample format
259 Annex A (informative) Background and tutorial
A.1 Annex overview
A.2 Design considerations
A.2.1 Usage
A.2.1.1 Multi-purpose
A.2.1.2 Interchange
A.2.1.3 Content creation
260 A.3 Design principles
261 A.4 Core concepts
A.5 Physical structure of the media
262 A.6 Temporal structure of the media
A.7 Interleave
A.8 Composition
263 A.9 Random access
A.10 Fragmented movie files
264 A.11 Construction of fragmented movies
265 A.12 Transformed streaming over streaming protocols
A.12.1 Design considerations for streaming protocols
A.12.1.1.1 Preparation for transformed streaming
A.12.1.1.2 Local presentation
266 A.12.1.1.3 Streamed presentation
A.12.2 Design considerations for streaming protocols
A.12.2.1 General
267 A.12.2.2 Protocol ‘hint’ tracks
268 Annex B (informative) Guidance on deriving from this document
B.1 General
B.2 General principles
B.2.1 General
B.2.2 Base layer operations
269 B.3 Boxes
270 B.4 Brand identifiers
B.4.1 Overview
B.4.2 Usage of the brand
B.4.3 Introduction of a new brand
271 B.4.4 Player guideline
B.4.5 Authoring guideline
B.4.6 Example
272 B.5 Storage of new media types
273 B.6 Tracks
B.6.1 Data location
274 B.6.2 Time
B.6.3 Media types
B.6.4 Coding types
B.6.5 Sub-sample information
275 B.6.6 Sample dependency
B.6.7 Sample groups
B.6.8 Track-level
B.6.9 Protection
276 B.7 Metadata
B.8 Sample groups, timed metadata tracks, and sample auxiliary information
278 Annex C (normative) Fragment identifiers for ISO base media resources
C.1 Overview
C.2 Syntax and semantics
280 Annex D (informative) Management of extension code-points
D.1 Code points
281 D.2 Procedure for the request of an extension code-point
282 Annex E (normative) File format brands
E.1 General
283 E.2 The ‘isom’ brand
284 E.3 The ‘avc1’ brand
E.4 The ‘iso2’ brand
285 E.5 The ‘mp71’ brand
286 E.6 The ‘iso3’ brand
E.7 The ‘iso4’ brand
287 E.8 The ‘iso5’ brand
E.9 The ‘iso6’ brand
288 E.10 The ‘iso7’ brand
E.11 The ‘iso8’ brand
289 E.12 The ‘iso9’ brand
E.13 The ‘isoa’ brand
E.14 The ‘isob’ brand
290 E.15 The ‘relo’ brand
E.15.1 Requirements for files
E.15.2 Requirements for readers
E.15.3 Use cases (informative)
E.15.3.1 File editing
291 E.15.3.2 Partial image file reception
292 E.16 The ‘isoc’ brand
E.17 The ‘comp’ brand
293 E.18 The ‘unif’ brand
E.18.1 General
E.18.2 Requirements for files
294 E.18.3 Requirements for readers
295 Annex F (normative) MIME type registration of segments
F.1 Overview
F.2 Registration
296 Annex G (informative) URI-labelled metadata forms
G.1 UUID-labelled metadata
G.2 ISO OID-labelled metadata
G.3 SMPTE-labelled metadata
298 Annex H (informative) Processing of RTP streams and reception hint tracks
H.1 General
H.1.1 Overview
H.1.2 Structure
H.1.3 Specific definitions
H.2 Synchronization of RTP streams
299 H.3 Recording of RTP streams
H.3.1 Overview
302 H.3.2 Compensation for unequal starting for position of received RTP streams
303 H.3.3 Recording of SDP
H.3.4 Creation of a sample within an RTP reception hint track
304 H.3.5 Representation of RTP timestamps
307 H.3.6 Recording operations to facilitate inter-stream synchronization in playback
H.3.6.1 General
308 H.3.6.2 Facilitating lip synchronization based on RTCP sender reports
H.3.6.3 Compensating clock drift in timestamps
309 H.3.7 Representation of reception times
H.3.8 Creation of media samples
310 H.3.9 Creation of hint samples referring to media samples
H.4 Playing of recorded RTP streams
H.4.1 Overview
H.4.2 Preparation for the playback
311 H.4.3 Decoding of a sample within an RTP reception hint track
H.4.4 Lip synchronization
313 H.4.5 Random access
H.5 Re-sending recorded RTP streams
H.5.1 Overview
314 H.5.2 Re-sending RTP packets
315 H.5.3 RTCP processing
316 Annex I (normative) Stream access points
I.1 General
I.2 SAP properties
I.2.1 General
317 I.2.2 SAP properties for layers
318 I.3 SAP types
319 I.4 SAP definitions for the ISO base media file format
320 Annex J (informative) Segment index examples
J.1 General
J.2 Examples
J.2.1 Simple one-level indexing
J.2.2 Hierarchical
321 J.2.3 Daisy-chain
322 J.2.4 Combination hierarchical and daisy-chain
324 Annex K (normative) Use of IETF RFC 6381 for ISOBMFF files
K.1 General
K.2 Use of the ‘codecs’ parameter
K.2.1 General rules
325 K.2.2 Syntax
K.3 The ‘codecs’ parameter for transformed media tracks
326 K.4 Use of the ‘profiles’ parameter
327 K.5 Use of the ‘itemtypes’ parameter
K.6 Use of the ‘essential’ parameter
BSI 22/30464047 DC 2022
$45.21