Benutzer: Gast  Login
Mehr Felder
Einfache Suche
Verantwortlich:
Gebendorfer, Christoph; Elnaggar, Ahmed
Autorinnen / Autoren:
Gebendorfer, Christoph; Elnaggar, Ahmed
Institutionszugehörigkeit:
TUM
Herausgeber:
TUM
Titel:
Legal DCEP - Translation Corpus
Identifikator:
doi:10.14459/2018md1446648
Enddatum der Datenerzeugung:
30.01.2018
Fachgebiet:
DAT Datenverarbeitung, Informatik
zusätzliche Fachgebiete:
Legal Domain
Quellen der Daten:
Textdokumente / text documents
Datentyp:
Texte / texts
Methode der Datenerhebung:
Derivation of the DCEP corpus
Beschreibung:
The Legal DCEP is a derivation of the original Digital Corpus of the European Parliament. The Digital Corpus of the European Parliament (DCEP) is a data collection which contains descriptive legal texts (Agendas of plenary sessions, Parliamentary News, Press Releases, Motions for Resolutions, Plenary Sitting Protocols, Reports of the Parliamentary Comittees, Rules of Procedure of the European Parliament, Final Texts of Plenary Votes, Written Questions) published by the Joint Research Centre (JRC...     »
Links:

Chair:

https://wwwmatthes.in.tum.de/pages/t5ma0jrv6q7k/sebis-Public-Website-Home

Used in order to train a deep learning translation model in the legal domain:

https://wwwmatthes.in.tum.de/pages/s4orjknmqls4/Master-s-Thesis-Christoph-Gebendorfer

Original Corpus:

https://ec.europa.eu/jrc/en/language-technologies/dcep

Schlagworte:
legal-dcep; parallel texts from the European Parliament; DCEP documents
Technische Hinweise:
Moses/Giza++ Format
View and download (4.4 GB, 23 files)
The data server also offers downloads with FTP
The data server also offers downloads with rsync (password m1446648):
rsync rsync://m1446648@dataserv.ub.tum.de/m1446648/
Sprache:
de
Rechte:
by, http://creativecommons.org/licenses/by/4.0
Andere Rechte:
Rights implied by original corpus (DCEP) - property of the European Parliament
 BibTeX