-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Did you use other scripts beside Latin in MLT2017 for pre-training? #13
Comments
Our model only works with Latin scripts. You may refer to #12 if you want to extend the character set. |
@zx1239856 Thanks a lot! And I found the coco format data has column "polys" with 16 numbers (8 points?) for each, how did you convert ICDAR dataset that only have bounding box with four points to that? |
The 8 points "polys" are control points of Bezier curves. For polygon annotations, we use 16 points. We reuse this column name for both types of annotations. Please refer to https://github.com/mlpc-ucsd/TESTR/blob/main/adet/data/builtin.py#L19-L47. For example, In terms of conversion from quadrilateral boxes to 16-point polygons, I assume you may add 3 more points to each side of the box. I've also seen other approaches based on the division of circumference, i.e. you divide the perimeter into 13 segments evenly and therefore 12 points can be added. |
Thanks! So your illustration is exactly the same with your method of conversion? (divide the perimeter into 13 segments evenly and therefore 12 points can be added) |
We simply adopted the first approach I mentioned above i.e. inserting 3 more points on each side. |
I got it, thank you so much:) |
Did you use other scripts beside Latin in MLT2017 for pre-training?
The text was updated successfully, but these errors were encountered: