Composition of filenames
General rules related to OJ instances
[ 1] |
filename |
::= |
file.OJ | file.ECR | file.Schema | file.PDF | file.CL | file.Sec.Leg |
[ 2] |
file.OJ |
::= |
series subseries year ojno language '.' (volume page sequence '.')? (container '.')? type |
[ 3] |
series |
::= |
'A' | 'C' | 'L' | 'P' | 'S' | 'X' |
[ 4] |
subseries |
::= |
'_' | 'A' | 'E' | 'I' | 'M' |
[ 5] |
year |
::= |
d{4} |
[ 6] |
ojno |
::= |
d{3} |
[ 7] |
language |
::= |
'BG' | 'CS' | 'DA' | 'DE' | 'EL' | 'EN' | 'ES' | 'ET' | 'FI' | 'FR' | 'GA' | 'HR' | 'HU' | 'IS' | 'IT' | 'LT' | 'LV' | 'MT' | 'NL' | 'PL' | 'PT' | 'RO' | 'SK' | 'SL' | 'SV' | 'TR' |
[ 8] |
volume |
::= |
d{2} |
[ 9] |
page |
::= |
d{4} | ('C'| 'E' | 'S') d{3} |
[10] |
sequence |
::= |
d{2} |
[11] |
container |
::= |
'doc' | 'frag' | 'toc' |
[12] |
type |
::= |
'frg' | 'xml' | 'tif' | 'tgz' | 'zip' |
[13] |
file.Sec.Leg |
::= |
'DD' year '_' chapter tome language '.' (page sequence '.')? (container '.')? type |
[14] |
chapter |
::= |
d{2} |
[15] |
tome |
::= |
d{3} |
[16] |
d |
::= |
'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9' |
Vocabulary
chapter |
chapter within the context of secondary legislation publications |
container |
indicator for instance describing the composition of a document or a publication: no container: XML instance, document component doc: description of a document frag: encapsulation of fragments toc: table of contents of a volume (not a table of contents within a document!) |
d |
symbolic placeholder for digits |
file.OJ |
file within a OJ publication |
file.ECR |
file within European Court Reports |
file.Schema |
file containing a Formex schema version |
file.PDF |
file within a publication in PDF |
file.CL |
file containing a consolidated act |
file.Sec.Leg |
table of contents or specific file within a secondary legislation publication |
language |
language of the document or the publication |
ojno |
number of the Official Journal |
page |
starting page of the document; in some cases the number is preceded by a letter, which indicates that the numbering does not follow the normal pagination; the letter 'S' indicates supplement pages which are used for the cover pages of a publication of for inserted forms in the case of competitions, 'E' indicates any other inserted object, 'C' precedes the number of cover pages of a consolidated document; in any case the length of the pagination never exceeds 4 characters |
sequence |
sequence of document on the starting page |
series |
series of the Official Journal |
subseries |
indication of an Official Journal annexed to another one (A), only existing in an electronic format (E), being an isolated publication (I) or a special edition in Maltese (M) |
tome |
tome within a chapter of the secondary legislation |
type |
indication of the contents format: frg: The file contains fragments of a bigger document; the content is supposed to be coded in XML (Formex), but it must not necessarily be valid nor well-formed. xml: The file contains a valid XML (Formex) instance. This is why a container of type 'frag' should always have this file extension. tif: the file contains an image in TIFF format. |
volume |
indication of the volume of a publication, in most cases this value is 01 |
year |
year of the publication of the Official Journal |
Examples:
|
General rules related to OJ instances in ABA
[17] |
file.ABA |
::= |
series placeholder pub.year pub.number language '.' (page sequence '.')? (container '.')? format '.' type |
[18] |
series |
::= |
'C' | 'L' |
[19] |
placeholder |
::= |
'_' | c{1} |
[20] |
pub.year |
::= |
d{4} |
[21] |
pub.number |
::= |
d{5} |
[22] |
language |
::= |
'BG' | 'CS' | 'DA' | 'DE' | 'EL' | 'EN' | 'ES' | 'ET' | 'FI' | 'FR' | 'GA' | 'HR' | 'HU' | 'IS' | 'IT' | 'LT' | 'LV' | 'MT' | 'NL' | 'PL' | 'PT' | 'RO' | 'SK' | 'SL' | 'SV' | 'TR' |
[23] |
page |
::= |
d{4} | ('C'| 'E' | 'S') d{3} |
[24] |
sequence |
::= |
d{2} |
[25] |
container |
::= |
'toc' | 'doc' | 'frag' |
[26] |
format |
::= |
'fmx' |
[27] |
type |
::= |
'frg' | 'xml' | 'tif' | 'tgz' | 'zip' |
[28] |
d |
::= |
‘0’ | ‘1’ | ‘2’ | ‘3’ | ‘4’ | ‘5’ | ‘6’ | ‘7’ | ‘8’ | ‘9’ |
[29] |
c |
::= |
'A' | 'B' | 'C' | 'D' | 'E' | 'F' |'G' | 'H' | 'I' | 'J' | 'K' |'L' | 'M' | 'N' | 'O' | 'P' | 'Q' | 'R' | 'S' | 'T' | 'U' | 'V' |'W' |'X' | 'Y' | 'Z' |
Vocabulary
c |
symbolic placeholder for a capital letter |
container |
indicator for instance describing the composition of a document or a publication:
|
d |
symbolic placeholder for digits |
file.ABA |
file published following ABA production |
format |
the format used for the xml instances - “fmx” (FORMEX); to be distinguished from other future formats |
language |
language of the document or the publication |
page |
starting page of the document; in some cases the number is preceded by a letter, which indicates that the numbering does not follow the normal pagination; the letter 'S' indicates supplement pages which are used for inserted forms in the case of competitions, 'E' indicates any other inserted object, 'C' precedes the number of cover pages of a consolidated document; in any case the length of the pagination never exceeds 4 characters |
placeholder |
a position for a character generally represented by an “_” or a capital letter |
pub.number |
the number of the publication:
|
pub.year |
the year of the publication |
sequence |
sequence of document on the starting page |
series |
series L and C of the Official Journal |
type |
indication of the contents format:
|
Examples:
|
General rules for ECR instances
These definitions may only be used in the context of the production of European Court Reports.
[30] |
file.ECR |
::= |
"ECR" court year lg part subpart? "." fascicle ((start.page page.sequence)? (".case" | ".notice")?)? (".xml" | ".tif") |
[31] |
court |
::= |
"CJ" | "T1" | "F1" | "F2" |
[32] |
year |
::= |
d{4} |
[33] |
lg |
::= |
"BG" | "CS" | "DA" | "DE" | "EL" | "EN" | "ES" | "ET" | "FI" | "FR" | "GA" | 'HR' | "HU" | 'IS' | "IT" | "LT" | "LV" | "MT" | "NL" | "PL" | "PT" | "RO" | "SK" | "SL" | "SV" | "XM" |
[34] |
part |
::= |
"A"|B"|"C" |
[35] |
subpart (only for CST reports) |
::= |
"1" | "2" |
[36] |
fascicle |
::= |
d{2} |
[37] |
start.page |
::= |
([aprs]d{5})|d{6} |
[38] |
page.sequence |
::= |
d{2} |
Examples
|
The schema file
The file containing the XML schema has a special filename. It is composed as follows:
[39] |
file.Schema |
::= |
'formex-' release '-' date '.xd' |
[40] |
release |
::= |
d{2} "." d{2}|"cj" |
[42] |
date |
::= |
d{8} |
[42] |
d |
::= |
'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9' |
The schema file which currently is in force can be found under the following address:
- http://formex.publications.europa.eu/
Example
|
The PDF files
All the PDF filenames should follow the definition below:
[43] |
file.PDF |
::= |
series subseries OJno date language (startpage endpage)? quality? '.pdf' |
[44] |
date |
::= |
year month day |
[45] |
month |
::= |
d{2} |
[46] |
day |
::= |
d{2} |
[47] |
startpage |
::= |
(d{4}) | ([cser]d{3} |
[48] |
endpage |
::= |
(d{4}) | ([cser]d{3} |
[49] |
quality |
::= |
'.x' |
PDF filenames must be in lower-case.
The table of contents should have as start and end page the number '0000'. The archive should contain also the table of contents of the delivered Official Journal in XML format (should be named following the example below).
Vocabulary
date |
The date of the publication of the OJ. |
endpage |
Number of the last page in the document. |
startpage |
Number of the first page in the document. |
quality |
Indication of the PDF quality: 'x' refers to PDF/X, absence of quality (default value) refers to PDF/A. |
Examples
|
The PDF files in ABA
All the PDF filenames in ABA production should follow the definition below:
[50] |
file.ABA.PDF |
::= |
series placeholder pub.year pub.number language quality? '.pdf' |
[51] |
series |
::= |
'C' | 'L' |
[52] |
placeholder |
::= |
'_' | c{1} |
[53] |
pub.year |
::= |
d{4} |
[54] |
pub.number |
::= |
d{5} |
[55] |
language |
::= |
'BG' | 'CS' | 'DA' | 'DE' | 'EL' | 'EN' | 'ES' | 'ET' | 'FI' | 'FR' | 'GA' | 'HR' | 'HU' | 'IS' | 'IT' | 'LT' | 'LV' | 'MT' | 'NL' | 'PL' | 'PT' | 'RO' | 'SK' | 'SL' | 'SV' | 'TR' |
[56] |
quality |
::= |
'.x' |
[57] |
d |
::= |
'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9' |
[58] |
c |
::= |
'A' | 'B' | 'C' | 'D' | 'E' | 'F' |'G' | 'H' | 'I' | 'J' | 'K' |'L' | 'M' | 'N' | 'O' | 'P' | 'Q' | 'R' | 'S' | 'T' | 'U' | 'V' |'W' |'X' | 'Y' | 'Z' |
Vocabulary
c |
symbolic placeholder for a capital letter. |
d |
symbolic placeholder for digits. |
file.ABA.PDF |
file within a publication in PDF. |
language |
language of the document or the publication. |
placeholder |
a position for a character generally represented by an “_” or a capital letter. |
pub.number |
the number of the publication:
|
pub.year |
the year of the publication. |
series |
series L and C of the Official Journal. |
quality |
Indication of the PDF quality: 'x' refers to PDF/X, absence of quality (default value) refers to PDF/A. |
Examples
|
General rules for Consolidation
Formex files delivered in the context of consolidation (contents: clfilename, archive: clarchive) have to be named as follows:
[59] |
file.CL |
::= |
series year celexcode lg prodseq "." incseq "." (container ".")? type |
[60] |
clarchive |
::= |
series year celexcode lg prodseq "." contenttype "." intermediate? archivetype |
[61] |
series |
::= |
"CL" |
[62] |
year |
::= |
d{4} |
[63] |
celexcode |
::= |
c{4} |
[64] |
lg |
::= |
"BG" | "CS" | "DA" | "DE" | "EL" | "EN" | "ES" | "ET" | "FI" | "FR" | "GA" | 'HR' | "HU" | 'IS' | "IT" | "LT" | "LV" | "MT" | "NL" | "PL" | "PT" | "RO" | "SK" | "SL" | "SV" | "XM" |
[65] |
prodseq |
::= |
d{7} |
[66] |
incseq |
::= |
d{4} |
[67] |
container |
::= |
"doc" | "sls" |
[68] |
type |
::= |
"xml" | "tif" | "pdf" |
[69] |
contenttype |
::= |
"xml" | "pdf" |
[70] |
intermediate |
::= |
"int." |
[71] |
archivetype |
::= |
"zip" | "tgz" |
[72] |
c |
::= |
'A' | 'B' | 'C' | 'D' | 'E' | 'F' |'G' | 'H' | 'I' | 'J' | 'K' | 'L' | 'M' | 'N' | 'O' | 'P' | 'Q' | 'R' | 'S' | 'T' | 'U' | 'V' | 'W' |'X' | 'Y' | 'Z' |
[73] |
d |
::= |
'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9' |
Vocabulary
archivetype |
type of compression method |
c |
symbolic placeholder for a character |
celexcode |
CELEX code (5 positions, capital letter followed by 4 digits) |
container |
flag for contents of instance: document description or consolidation description |
contenttype |
type of format in archive, XML or PDF |
d |
symbolic placeholder for digits |
incseq |
incremental sequence number starting with 0001, incremented by 1 |
intermediate |
flag for intermediate, not definitive delivery |
lg |
symbolic placeholder for the language of the document or archive |
prodseq |
production sequence number as defined by the attribute PROD.SEQ of the INFO.CONSLEG element |
series |
fixed value: CL |
type |
file format |
year |
year of the basic document of the consolidated family |
Examples
|
General rules for secondary legislation
The naming of the documents which are translations of documents published in the OJ will follow the existing rules. For the documents that are specific to the special edition, the following convention should be adopted:
[74] |
file.Sec.Leg |
::= |
'DD' year '_' chapter.se volume.se language '.' (volume page sequence '.')? (container '.')? type |
[75] |
year |
::= |
d{4} |
[76] |
chapter.se |
::= |
d{2} |
[77] |
volume.se |
::= |
d{3} |
[78] |
language |
::= |
'BG' | 'CS' | 'DA' | 'DE' | 'EL' | 'EN' | 'ES' | 'ET' | 'FI' | 'FR' | 'GA' | 'HR' | 'HU' | 'IS' | 'IT' | 'LT' | 'LV' | 'MK' | 'MT' | 'NL' | 'PL' | 'PT' | 'RO'| 'SK' | 'SL' | 'SV' | 'TR' |
[79] |
page |
::= |
d{4} | ('C'|'E'|'S') d{3} |
[80] |
sequence |
::= |
d{2} |
[81] |
container |
::= |
'doc' | 'frag' | 'toc' |
[82] |
type |
::= |
'frg' | 'xml' | 'tif' | 'tgz' | 'zip' |
[83] |
d |
::= |
'0' | '1' | '2' | '3' | '4' | '5' | '6' | '7' | '8' | '9' |
Vocabulary
chapter.se |
chapter number of the special edition |
container |
indicator for instance describing the composition of a document or a publication: no container: XML instance, document component doc: description of a document frag: encapsulation of fragments toc: table of contents of a volume (not a table of contents within a document!) |
d |
symbolic placeholder for digits |
language |
language of the document or the publication (a list of valid values is available within the schema) |
page |
starting page of the document; in some cases the number is preceded by a letter, which indicates that the numbering does not follow the normal pagination; the letter 'S' indicates supplement pages which are used for the cover pages of a publication or for inserted forms in the case of competitions, 'E' indicates any other inserted object, 'C' precedes the number of the cover pages of a consolidated document; in any case the length of the pagination never exceeds 4 characters |
sequence |
sequence of document on the starting page |
type |
indication of the contents format: frg: The file contains fragments of a bigger document; the content is supposed to be coded in XML (Formex), but it must not necessarily be valid nor well-formed xml: The file contains a valid XML (Formex) instance. This is why a container of type 'frag' should always have this file extension xsl: The file contains a layout description in form of a stylesheet in XSL tif: the file contains an image in TIFF format. |
volume.se |
volume number of the special edition |
year |
year of the publication of the Official Journal |
Examples:
|