User: Guest  Login
Responsible:
Gebendorfer, Christoph; Elnaggar, Ahmed
Authors:
Gebendorfer, Christoph; Elnaggar, Ahmed
Author affiliation:
TUM
Publisher:
TUM
Title:
Legal DCEP - Translation Corpus
Identifier:
doi:10.14459/2018md1446648
End date of data production:
30.01.2018
Subject area:
DAT Datenverarbeitung, Informatik
Other subject areas:
Legal Domain
Resource type:
Textdokumente / text documents
Data type:
Texte / texts
Description:
The Legal DCEP is a derivation of the original Digital Corpus of the European Parliament. The Digital Corpus of the European Parliament (DCEP) is a data collection which contains descriptive legal texts (Agendas of plenary sessions, Parliamentary News, Press Releases, Motions for Resolutions, Plenary Sitting Protocols, Reports of the Parliamentary Comittees, Rules of Procedure of the European Parliament, Final Texts of Plenary Votes, Written Questions) published by the Joint Research Centre (JRC...     »
Method of data assessment:
Derivation of the DCEP corpus
Links:

Chair:

https://wwwmatthes.in.tum.de/pages/t5ma0jrv6q7k/sebis-Public-Website-Home

Used in order to train a deep learning translation model in the legal domain:

https://wwwmatthes.in.tum.de/pages/s4orjknmqls4/Master-s-Thesis-Christoph-Gebendorfer

Original Corpus:

https://ec.europa.eu/jrc/en/language-technologies/dcep

Key words:
legal-dcep; parallel texts from the European Parliament; DCEP documents
Technical remarks:
Moses/Giza++ Format
View and download (4.4 GB, 23 files)
The data server also offers downloads with FTP
The data server also offers downloads with rsync (password m1446648):
rsync rsync://m1446648@dataserv.ub.tum.de/m1446648/
Language:
de
Rights:
by, http://creativecommons.org/licenses/by/4.0
Other rights:
Rights implied by original corpus (DCEP) - property of the European Parliament
 BibTeX