Welcome to our blog!

We are happy to announce that we have finally published an Open Repository for Machine Translation DataSheets! Check it out!

Here is a list of all our posts available at the moment:

  • PHASE #1: Machine Translation
    In order to start our project we need to do some preliminary search. We have studied the corpora we will be working with (Phase 1), we have read articles on Machine Translation (Phase 2 & 3) and also on the biases present in it (Phase 4, 5 & 6).
  • PHASE #2: Characterization and documentation of translation datasets
    We have designed the generalized structure of a Datasheet that describes a corpus that has NOT been created by us (Phase 1) and that is Machine Translation related (Phase 2).
  • PHASE #3: Datasheets for Machine Translation
    In this section we offer a Datasheet template aligned with Machine Translation (Phase 1), as well as, we provide the first two examples for the community (Phase 2).
  • Activities!
    In this section we provide the activities we prepared for our session!
  • Conclusions
    In this section we explain the conclusions we extracted from the activities we had organized in the Algorithmic Research Session.
  • PHASE #4: The End
    After concluding the whole process, we have decided to include some new Qs to the Datasheet that could be of interest to the readers.