The University of Jordan :: Research Groups :: Using Technology to Compile an English-Arabic...
Featured Publications

Using Technology to Compile an English-Arabic Glossary of the most Frequent Collocations in Ted Talks Parallel Corpus

​Translation has evolved over the years and is now benefiting from the progress made in the  field  of  technology  and  corpus  linguistics.  This  study  aims  to  use  technology, particularly AntConc toolkit, to compile an English-Arabic glossary of the most frequent collocations  listed  in  TED  Talks  parallel  corpus.  The  glossary  contains  1,000  unique English  headwords  and  a  total  of  3,670  English  collocations  along  with  their  Arabic translations as produced by Ted Talks translators and the researchers. Headwords in the glossary  are  alphabetically  ordered  and  each  collocation  is  accompanied  with  its frequency and its Arabic translation. The paper demonstrates the process through which the bilingual English-Arabic  glossary has been compiled and the steps taken to process the collected data​​