-
-
Notifications
You must be signed in to change notification settings - Fork 851
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Switch skycultures to the new format #3751
base: master
Are you sure you want to change the base?
Conversation
OMG, translators will hate us for that. Back to start for everything? Review all Google translations again? Any chance to see the old tranlsations? |
The cultures in the external repo have some customized texts, so if we import them, the translations will have to change one way or another. One way to go would be to start with converting all the current cultures to the new format, and only then replace them with the ones in the external repo. But anyway, something must be done with the translations at some point—now or after the separate import, and this does imply a large review.
Yes, I expected this. The change is huge. |
Hello,
I think the old translations for object names (constellations etc..) should be more or less preserved with probably some errors (Ruslan can you confirm this?). But clearly the existing translations for the sky culture descriptions are lost. Most of the translations in the stellarium-skycultures repo were generated with google translate, and I still think auto-translation is the way to go for those long texts, but with better AI-based tools. Some tests I did showed that ChatGPT can perform remarkably well for many languages, much better than google translate (especially when passing a meaningful context in the prompt). For example I don't think I could do a better job than ChatGPT in French.
Yes, the repo already contains a documentation in the README.md. It's not enough but it's a good start. |
The regions in new format (and in Mobile and Web editions) are different in comparison to Desktop edition (or old format) - I think we should use one universal list for regions (at least for SC) for all editions of planetarium. |
They don't seem to have been copied from the original sky cultures. E.g. in Anutan original:
and new:
The lack of the dieresis in the first name and failure to capitalize the second one compared to their old versions hint that they were translated independently. Even worse, there are simply wrong translations, e.g.:
becomes
Here in the new format the plant (vegetation) is translated with its second meaning (factory), and also is sloppy grammar-wise. |
This is why all these machine translations (which of course have no context) must be marked unreviewed and reviewed (again) by a human with fitting background knowledge. This is a huge effort. Of course, the unreviewed "candidates" can go into the releases as before, to be found by all users. Should we add a "You found a suspect translation? Go to [Transifex] to help!" button to make that even more visible? (Of course also a note in the 24.3/24.4/25.1 release notes, but who reads them :-) The user translation again needs review/approval, of course. |
I think it's better to improve the context passed to ChatGPT until everything is correct in the languages we know like Russian, German and French. Then use the same context for all languages to minimize the amount of errors. Note that when I created the new format I tried to re-use the existing translations as much as I could, so I am not sure why it diverged in your examples.. |
Major SCs may have "canonical" translations in use for decades in the major languages where relevant books appear. These should be preferred (with a note like "German translations following X.Y. (1976)"!) over self-made translation dabbles or AI tools. |
Immediate reactions/ thoughts:
|
Further comments on the format
Can we find a solution for these cases to use the image in the "illustration" folder directly in the description? This concerns the following SCs:
Should we define a sort of template or "standard" ("one to rule them all" will not really work but maybe guidelline?) for the description
|
I think in our context "Western" has always predated the Iron Curtain meaning by centuries. What is commonly understood by "western" is European scholarship from the age of enlightenment but rooted in European antiquity (traditionally executed in universities and Academies of Science from Lissabon to St. Petersburg), as opposed to e.g. Islamic, Chinese, Indian, and indigenous traditions in other continents which are, in western scholarship, usually dealt with in "ethnographic studies". Still, we have agreed to rename all Western* to Modern*. |
Yes, we should use the images from the illustrations/ subfolder directly in the description. There is nothing preventing this from a technical point of view. In general in the new format I really encourage to avoid adding a section dedicated to each constellations outside the already existing ## Constellations section. The code then cross-match the content with the content of the index.json file, so it's usually not even necessary to link to the image at all.
It's already like that. The template for the markdown file has a strict structure with mandatory sections. |
Yes.. In Stellarium Mobile we didn't switch because this work predated the renaming. I am a bit worried to do that now because in practice the "Modern" name seems to be annoying some users.. I have seen angry emails.. But I guess we will also need to switch.. Hopefully we won't receive too many bad reviews.. |
Everyone I know who uses localized software expects the translations to be good—at least made by people who speak both the source and the target languages. They definitely don't think of it as "reading something in a foreign language". Moreover, many users don't even read in foreign languages well enough (or at all) to be able to cross-check anything. In my view, using an unedited machine translation is just a mark of poor quality of the product (which unfortunately applies to lots of commercial software nowadays, even those products that used to have great localizations two decades ago). Anyway, I'm now going to switch to a bit more conservative approach for this PR and convert all "old" sky cultures to the new format, so that we could handle the switch to the new ones in a separate thread, with all the problems of the translations. |
To be more precise, "Modern" are those from the 20th century and later that obey IAU constellations and borders. These are our default and some variants ("single presentations" after Rey, S&T, Hlad, others?). What did we decide on European 17-19th century atlases? (Or are they just "Hevelius", "Bayer", "Bode (1782)", "Bode (1801)" etc.?) In this respect, we could still call our default (classic Stellarium) "Default" or even "Stellarium", pointing out the originality of Johan's figure set [which has been taken over successfully outside the project] and giving us all liberties about what to include, and the others "Modern-S&T", "Modern-Rey" etc. |
your opinion! in reseach "western" is used in the recent decades by scholars west of the iron curtain (=western europe + n.america) |
hmmmm... Thinking of software: I think, you are right, that's a bit different. we expect the translation to be good enough that we don't need to understand the technology before reading the text that explains it (which makes the text useless). |
So, is Western Physics much different from Physics researched in Beijing? |
in my childhood, we called it "modern physics"/ "modern science" and not "western": that's what I am saying. if you want to politically frame a term (which was done in this time), you need to find differnt terms for things that have nothing to do with the negatively framed terms: like science. China has confuzianism in addition to modern physics. |
yes, I hope so, too... maybe point them to me in this case. In the 1990s we (east-germans) have undergone a linguistic re-education: suddenly, many terms were used differently and some terms were "forbidden" or meant sth. else ... as this influenced me rather deeply, I think a lot about the terms. I certainly do not want to 'always go back' but in contrast, I am embracing change. However, I think, in some cases the "newer" version does not really make sense. In case of the "western", I have the impression that it is both, a) too politically charged in whatever direction ('good' for one is 'bad' for others) and b) sometimes really confusing (because, e.g.. depending on the context "western" means different things: sometimes, I really have to think about the meaning of a sentence). |
Sure, you call that my opinion. But I feel I am not alone. The rest of the world still uses and understands the term "Western Science" without problems. Quick example: https://en.wikipedia.org/wiki/The_Beginnings_of_Western_Science This is fully non-political. Sorry, but maybe it was your childhood experience that was politicized by the powers around you then, when everything from the "West", even the European science tradition, had to be presented in a bad light or needed a new name in the GDR. But even the Soviet A-bomb is based on "Western" 20th century physics. (Not only thanks to Klaus Fuchs. The physics behind it was discovered in the European physics tradition of science, in North America, while in Germany a non-Einsteinian "German Physics" was tried and failed instead. There is probably just one unpolitical way nature behaves, and our scientific understanding (call it European, Western, Modern or what you want) seems to provide the best model, despite shortcomings). The political East/West separation is a post-1945 (no "Eastern Block" before that) thingy that we had all hoped to have overcome in 1991. Before that there was of course the Christian East/West divide which had a strong influence in traditions and beliefs, but royal courts were closely related from UK to Russia, which of course was also an imperialistic monarchy by undisputed Grace of God that tried its best to be European (Western). I cannot say whether "East" was then not rather understood as "oriental, Ottoman" etc. OK, we have gone largely off-topic, and I would stop here. Above, I had suggested possibly renaming our own default "Modern" SC into "Stellarium" (to give us all liberties on style and displayed objects), and use Modern-* for those IAU-constellation aware SCs where traces of Western-* naming may still be found. I did not suggest renaming anything back to Western-* because of your expected opposition, although almost everybody was OK with that name. |
The term "Oriental" also depends from context: sometimes it is China, sometimes it West Asia. that is why there are terms like "Near East", "Middle East" and "Far East" which don't make sense.... "east" and "west" are defined by Aristotle as directions (since more than 2000 years clear). The sense comes in when you define the vertex where the vector starts. ... I really have more important things to do. Let's just happily disagree ... we will never have a consensus here. |
This pull request has conflicts, please resolve those before we can evaluate the pull request. |
7d97572
to
2b7be2f
Compare
Conflicts have been resolved. A maintainer will review the pull request shortly. |
I have at least scanned through the code a bit, but did not check every line. Apart from the mentioned stuff, looks good to me, but I will play through the options and existing SCs some more hours. So I think if @alex-w and @sushoff agree we could dare to go the bold step forward! Of course some more work is then ahead of us :-) Now I wanted to build the converter and hopefully convert one SC I had created (sorry, non-free images until 2033 :-(. Hopefully I am just missing some header, but I get errors: Ah, Now I see in DescriptionOldLoader.cpp: Also, why Qt5.15? Would Qt6 not be more future-proof? Or do you plan to remove it end-of-2025? |
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
I see a first problem parsing a constellation_names.eng.fab file when it does not end with a newline. The converter reports a problem, so it's at least easily found. Maybe also in other files? When the h1 header name in description (the long title) is different from the directory name (skycultureID), converter reports (with a typo marked)
In the json, the id field remains empty. Should not that be the name of the directory (or skycultureID)? In this particular case the second-language text (non-Latin) caused a flood of "libgettextpo: :0:0: invalid multibyte sequence" errors. Not sure what to do about them, or whether I should expect a likewise translated second version of description.md to appear? The po file in the po directory has text in english with many (but not all) lines ending in "\r\n". (This appears where the original description.en.utf8 had line endings.) If the other-language strings should also appear here, it did not work out, but maybe because of that multibyte issue. |
It should be what you marked as XXX. I'll need to see how you got the empty field. Maybe some minimal example, e.g. a very simple description file and an
Is it actually encoded in UTF-8? Does it have a BOM (it shouldn't)? Is the original text all in English? |
The parser was adapted from the old code in |
Oh, actually no, it should be the directory name. Anyway, I need a reproducer to debug this. |
This set of commits switches Stellarium to the new format of sky cultures used in stellarium-skycultures repo.
The old format is no longer supported, but a tool is provided (
util/skyculture-converter
) that helps convert an old culture to the new one (with a limited support for conversion of the description, mostly retaining HTML and only changing the heading structure to more or less follow the spec of the new format).The sky cultures from the sky cultures repo are imported using a script,
skycultures/update-skycultures.py
.Among the structural changes to this repo are:
skycultures/common_dso_names.fab
andskycultures/common_star_names.fab
now contain the common names that used to reside inmodern_iau
culture.po/stellarium-skycultures
now keeps translations of culture-specific names, while the common names are translated inpo/stellarium-sky
.po/stellarium-skycultures-descriptions
..po
entry per section.modern
culture that I converted to the new format and pushed into that repo, for compatibility with the Stellarium default.