Formex Version 4

Composition of filenames

General rules related to OJ instances

[ 1]

filename

::=

file.OJ | file.ECR | file.Schema | file.PDF | file.CL | file.Sec.Leg

[ 2]

file.OJ

::=

series subseries year ojno language '.' (volume page sequence '.')? (container '.')? type

[ 3]

series

::=

'A' | 'C' | 'L' | 'P' | 'S' | 'X'

[ 4]

subseries

::=

'_' | 'A' | 'E' | 'I' | 'M'

[ 5]

year

::=

d{4}

[ 6]

ojno

::=

d{3}

[ 7]

language

::=

'BG' | 'CS' | 'DA' | 'DE' | 'EL' | 'EN' | 'ES' | 'ET' | 'FI' | 'FR' | 'GA' | 'HR' | 'HU' | 'IS' | 'IT' | 'LT' | 'LV' | 'MT' | 'NL' | 'PL' | 'PT' | 'RO' | 'SK' | 'SL' | 'SV' | 'TR'

[ 8]

volume

::=

d{2}

[ 9]

page

::=

d{4} | ('C'| 'E' | 'S') d{3}

[10]

sequence

::=

d{2}

[11]

container

::=

'doc' | 'frag' | 'toc'

[12]

type

::=

'frg' | 'xml' | 'tif' | 'tgz' | 'zip'

[13]

file.Sec.Leg

::=

'DD' year '_' chapter tome language '.' (page sequence '.')? (container '.')? type

[14]

chapter

::=

d{2}

[15]

tome

::=

d{3}

[16]

d

::=

'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9'

Vocabulary

chapter

chapter within the context of secondary legislation publications

container

indicator for instance describing the composition of a document or a publication:

no container: XML instance, document component

doc: description of a document

frag: encapsulation of fragments

toc: table of contents of a volume (not a table of contents within a document!)

d

symbolic placeholder for digits

file.OJ

file within a OJ publication

file.ECR

file within European Court Reports

file.Schema

file containing a Formex schema version

file.PDF

file within a publication in PDF

file.CL

file containing a consolidated act

file.Sec.Leg

table of contents or specific file within a secondary legislation publication

language

language of the document or the publication

ojno

number of the Official Journal

page

starting page of the document; in some cases the number is preceded by a letter, which indicates that the numbering does not follow the normal pagination; the letter 'S' indicates supplement pages which are used for the cover pages of a publication of for inserted forms in the case of competitions, 'E' indicates any other inserted object, 'C' precedes the number of cover pages of a consolidated document; in any case the length of the pagination never exceeds 4 characters

sequence

sequence of document on the starting page

series

series of the Official Journal

subseries

indication of an Official Journal annexed to another one (A), only existing in an electronic format (E), being an isolated publication (I) or a special edition in Maltese (M)

tome

tome within a chapter of the secondary legislation

type

indication of the contents format:

frg: The file contains fragments of a bigger document; the content is supposed to be coded in XML (Formex), but it must not necessarily be valid nor well-formed.

xml: The file contains a valid XML (Formex) instance. This is why a container of type 'frag' should always have this file extension.

tif: the file contains an image in TIFF format.

volume

indication of the volume of a publication, in most cases this value is 01

year

year of the publication of the Official Journal

Examples:

L_2002309EN.01000101.xml

English version of instance of the document starting on page 1 of volume 1, sequence position 1, in Official Journal series L, no subseries, number 309 of the year 2002

L_2002309EN.01000101.doc.xml

English version of the description of the composition of the document starting on page 1 of volume 1, sequence position 1, in Official Journal series L, no subseries, number 309 of the year 2002

L_2002309EN.toc.xml

description of the publication of an Official Journal, English version, number 309 of 2002, series L

L_2002309EN.xml

table of contents accompanying the PDF package (SUMMARY.PDF instance)

L_2002309EN.xml.tgz

compressed archive of the delivery of the Formex version of an OJ

General rules for ECR instances

These definitions may only be used in the context of the production of European Court Reports.

[17]

file.ECR

::=

"ECR" court year lg part subpart? "." fascicle ((start.page page.sequence)? (".case" | ".notice")?)? (".xml" | ".tif")

[18]

court

::=

"CJ" | "T1" | "F1" | "F2"

[19]

year

::=

d{4}

[20]

lg

::=

"BG" | "CS" | "DA" | "DE" | "EL" | "EN" | "ES" | "ET" | "FI" | "FR" | "GA" | 'HR' | "HU" | 'IS' | "IT" | "LT" | "LV" | "MT" | "NL" | "PL" | "PT" | "RO" | "SK" | "SL" | "SV" | "XM"

[21]

part

::=

"A"|B"|"C"

[22]

subpart (only for CST reports)

::=

"1" | "2"

[23]

fascicle

::=

d{2}

[24]

start.page

::=

([aprs]d{5})|d{6}

[25]

page.sequence

::=

d{2}

Examples

ECRCJ2005FRA.01.xml

description of a fascicle

ECRCJ2005FRA.0100043901.case.xml

description of a case

ECRCJ2005FRA.0100044101.xml

contents

The schema file

The file containing the XML schema has a special filename. It is composed as follows:

[27]

file.Schema

::=

'formex-' release '-' date '.xd'

[28]

release

::=

d{2} "." d{2}|"cj"

[29]

date

::=

d{8}

[30]

d

::=

'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9'

The schema file which currently is in force can be found under the following address:

Example

formex-01.00-20040501.xd

Schema file release 01.00, entering into force May 1, 2004

The PDF files

All the PDF filenames should follow the definition below:

[31]

file.PDF

::=

series subseries OJno date language (startpage endpage)? quality? '.pdf'

[32]

date

::=

year month day

[33]

month

::=

d{2}

[34]

day

::=

d{2}

[35]

startpage

::=

(d{4}) | ([cser]d{3}

[36]

endpage

::=

(d{4}) | ([cser]d{3}

[36]

quality

::=

'.x'

PDF filenames must be in lower-case.

The table of contents should have as start and end page the number '0000'. The archive should contain also the table of contents of the delivered Official Journal in XML format (should be named following the example below).

Vocabulary

date

The date of the publication of the OJ.

endpage

Number of the last page in the document.

startpage

Number of the first page in the document.

quality

Indication of the PDF quality: 'x' refers to PDF/X, absence of quality (default value) refers to PDF/A.

Examples

l_30920020503en00000000.pdf

PDF document (table of contents)

l_30920020503en00010013.pdf

PDF document (pages 1-13)

l_30920020503en00140029.pdf

PDF document (pages 14-29)

l_30920020503en.xml

XML document (Table of Contents)

General rules for Consolidation

Formex files delivered in the context of consolidation (contents: clfilename, archive: clarchive) have to be named as follows:

[39]

file.CL

::=

series year celexcode lg prodseq "." incseq "." (container ".")? type

[40]

clarchive

::=

series year celexcode lg prodseq "." contenttype "." intermediate? archivetype

[41]

series

::=

"CL"

[42]

year

::=

d{4}

[43]

celexcode

::=

c{4}

[44]

lg

::=

"BG" | "CS" | "DA" | "DE" | "EL" | "EN" | "ES" | "ET" | "FI" | "FR" | "GA" | 'HR' | "HU" | 'IS' | "IT" | "LT" | "LV" | "MT" | "NL" | "PL" | "PT" | "RO" | "SK" | "SL" | "SV" | "XM"

[45]

prodseq

::=

d{7}

[46]

incseq

::=

d{4}

[47]

container

::=

"doc" | "sls"

[48]

type

::=

"xml" | "tif" | "pdf"

[49]

contenttype

::=

"xml" | "pdf"

[50]

intermediate

::=

"int."

[51]

archivetype

::=

"zip" | "tgz"

[49]

c

::=

'A' | 'B' | 'C' | 'D' | 'E' | 'F' |'G' | 'H' | 'I' | 'J' | 'K' |

'L' | 'M' | 'N' | 'O' | 'P' | 'Q' | 'R' | 'S' | 'T' | 'U' | 'V' |

'W' |'X' | 'Y' | 'Z'

[50]

d

::=

'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9'

Vocabulary

archivetype

type of compression method

c

symbolic placeholder for a character

celexcode

CELEX code (5 positions, capital letter followed by 4 digits)

container

flag for contents of instance: document description or consolidation description

contenttype

type of format in archive, XML or PDF

d

symbolic placeholder for digits

incseq

incremental sequence number starting with 0001, incremented by 1

intermediate

flag for intermediate, not definitive delivery

lg

symbolic placeholder for the language of the document or archive

prodseq

production sequence number as defined by the attribute PROD.SEQ of the INFO.CONSLEG element

series

fixed value: CL

type

file format

year

year of the basic document of the consolidated family

Examples

CL1953D0030FR0040010.xml.zip

archive with XML content

CL1953D0030FR0040010.xml.int.zip

intermediate delivery of an archive with XML content

CL1953D0030FR0040010.0001.doc.xml

document description

CL1953D0030FR0040010.0001.xml

consolidated XML document

CL1953D0030FR0040010.0001.sls.xml

consolidation history description

General rules for secondary legislation

The naming of the documents which are translations of documents published in the OJ will follow the existing rules. For the documents that are specific to the special edition, the following convention should be adopted:

[51]

file.Sec.Leg

::=

'DD' year '_' chapter.se volume.se language '.' (volume page sequence '.')? (container '.')? type

[52]

year

::=

d{4}

[53]

chapter.se

::=

d{2}

[54]

volume.se

::=

d{3}

[55]

language

::=

'BG' | 'CS' | 'DA' | 'DE' | 'EL' | 'EN' | 'ES' | 'ET' | 'FI' | 'FR' | 'GA' | 'HR' | 'HU' | 'IS' | 'IT' | 'LT' | 'LV' | 'MK' | 'MT' | 'NL' | 'PL' | 'PT' | 'RO'| 'SK' | 'SL' | 'SV' | 'TR'

[56]

page

::=

d{4} | ('C'|'E'|'S') d{3}

[57]

sequence

::=

d{2}

[58]

container

::=

'doc' | 'frag' | 'toc'

[59]

type

::=

'frg' | 'xml' | 'tif' | 'tgz' | 'zip'

[60]

d

::=

'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9'

Vocabulary

chapter.se

chapter number of the special edition

container

indicator for instance describing the composition of a document or a publication:

no container: XML instance, document component

doc: description of a document

frag: encapsulation of fragments

toc: table of contents of a volume (not a table of contents within a document!)

d

symbolic placeholder for digits

language

language of the document or the publication (a list of valid values is available within the schema)

page

starting page of the document; in some cases the number is preceded by a letter, which indicates that the numbering does not follow the normal pagination; the letter 'S' indicates supplement pages which are used for the cover pages of a publication or for inserted forms in the case of competitions, 'E' indicates any other inserted object, 'C' precedes the number of the cover pages of a consolidated document; in any case the length of the pagination never exceeds 4 characters

sequence

sequence of document on the starting page

type

indication of the contents format:

frg: The file contains fragments of a bigger document; the content is supposed to be coded in XML (Formex), but it must not necessarily be valid nor well-formed

xml: The file contains a valid XML (Formex) instance. This is why a container of type 'frag' should always have this file extension

xsl: The file contains a layout description in form of a stylesheet in XSL

tif: the file contains an image in TIFF format.

volume.se

volume number of the special edition

year

year of the publication of the Official Journal

Examples:

DD2004_02001CS.000101.xml

Czech version of instance of the first document starting on page 1, in Special Edition chapter 2 volume 1 of the year 2004

DD2004_02001CS.000101.doc.xml

Czech version of the description of the composition of the first document starting on page 1, in Special Edition chapter 2 volume 1 of the year 2004

L_1975100CS.01000101.xml

Czech version of instance of the document that corresponds to the translation of one published in OJL, year 1975, number 100, volume 1, page 1, sequence 1

L_1975100CS.01000101.doc.xml

Czech version of the description of the composition of the document that corresponds to the translation of one published in OJL, year 1975, number 100, volume 1, page 1, sequence 1

DD2004_02001CS.toc.xml

description of the publication of a Special Edition, Czech version, chapter 2 volume 1 of the year 2004

DD2004_02001CS.xml

table of contents accompanying the PDF package (SUMMARY.PDF instance)

DD2004_02001CS.xml.tgz

compressed archive of the delivery of the Formex version of a special edition

DD2004_02001CS.xml

instance accompanying the PDF delivery for the Special Edition, Czech version, chapter 2 volume 1 of the year 2004


Formex 4 - Physical Specifications - Filenames
Contact: OP A1.002 "Formats, Linguistic Informatics and Metadata"
Version: 5.59 (20170418)