How to prepare data #3

akanshajainn · 2018-09-28T09:39:24Z

In this repo you have provided the data in zipped which will be used to train the MT system. But I am planning to try it on different set of languages, but I am really stuck on how to prepare data for that. I do know how to tokenise, binarize the data, but don't know how to get those dictionary, and first translation data?

guillaumekln · 2018-10-04T07:58:31Z

As described in the README, the first translation data were generated by this project: https://github.com/jsenellart/papers/tree/master/WordTranslationWithoutParallelData

You should also be able to use the official project: https://github.com/facebookresearch/MUSE

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to prepare data #3

How to prepare data #3

akanshajainn commented Sep 28, 2018

guillaumekln commented Oct 4, 2018

How to prepare data #3

How to prepare data #3

Comments

akanshajainn commented Sep 28, 2018

guillaumekln commented Oct 4, 2018