-
Notifications
You must be signed in to change notification settings - Fork 8
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Document metadata.json format #78
Comments
Hey Peter! There should be no need to generate your own metadata.json file if you use the include XML parser since that'll generate the file for you. In fact, that is the default behavior (not assuming philo4). Generating your own metadata.json file is certainly doable, but I've never done it separately from the existing code base. Do you have a particular use case in mind? Now, I certainly take your point about documentation... I was hoping to find time for this lately, but sadly this has not happened. I should be making a new development push in the fall, so I'll definitely clear this up then. I'm going to leave this open since documentation reminders can't hurt... :-) |
Hi Clovis, I think I'm having similar issues to what Peter was dealing with. Basically, textpair's xml parser doesn't recognize the metadata in my target XML files. It does recognize the metadata in my source xml files, however, and as far as I can tell the document structures are the same. Maybe it would help to know what fields textpair is looking for, when it searches for metadata? I'll paste a precis of one of the problematic XML files at the bottom of this comment, in case the issue leaps out at you. Thanks for your help; I hope you don't mind my making this support request here. I figured that someone else struggling with the same problem might end up in the same place.
|
Hi James, Hmm.. odd... the file you sent looks good to me... Which metadata were you hoping to get picked up by the parser? |
Basically this stuff, which I converted to JSON manually to generate a metadata.json file (which also didn't work):
Thanks for getting back to me so quickly, though. |
OK... could you possibly email me one source and one target file so I can debug things on my end? Thanks! |
Sent! Thanks again. |
Hi James, Apologies for not getting back to you sooner! So the TEI parser built into TextPAIR can only extract document level metadata. And even that is somewhat restricted since only a couple of XPATHs are defined for each field (and even those are restricted). Supporting lower-level metadata parsing (and metadata parsing configurability) would require require some non-trivial changes. The alternative (and reason why this has never happened) is to load the texts under PhiloLogic (which allows you to specify within some limits the metadata fields you want to store) and use the PhiloLogic index to retrieve the metadata you want. So if you want ACT or SCENE metadata information for your alignments, you would want to go through PhiloLogic. Another upside of leveraging PhiloLogic is that you can link your alignments to PhiloLogic to get back to the context of the alignment. Hope that helps, and let me know if you need any pointers. C |
Just a quick thought -- having a minimal example of
metadata.json
could help folks who aren't in a Philo4 environment load in their texts...The text was updated successfully, but these errors were encountered: