The only global mechanism for format typing in current widespread use is the MIME(Multipurpose Internet Mail Extensions) Media Types [5] registry operated by IANA (InternetAssigned Numbers Authority) [11]. However, for many digital repository operations MIMEtyping does not provide sufficient granularity to disambiguate important format distinctions,whether based on versioning or profiling. For example, both a tiled RGB TIFF image usingLZW compression and a striped bi-tonal TIFF image with Group 4 compression are typed withthe same MIME identifier: image/tiff. Similarly, the entire PDF family – PDF 1.0 through 1.4,PDF/X-1 through 3 (ISO 15930), and the proposed PDF/A standard [1] – are all typed with asingle identifier: application/pdf. In both cases, the variant digital objects may undergo differentparallel workflows dependent upon the specifics of their internal structure or semantics. Tofacilitate this, the proposed format registry will allow typing, and unambiguous identification, atarbitrary levels of granularity.
This first mechanism is the important one, and will require some care around whether different features are really needed. Includes crucial issues like whether one submission could modify someone elses submission.