-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Standoff: annotation microstructure #1745
Comments
Web Annotation Data Model (WADM)
Web Annotation Principles The Web Annotation Data Model is defined using the following basic principles:
Annotation model
Example Use Case: Alice has written a post that makes a comment about a particular web page. Her client creates an Annotation with the post as the body resource, and the web page as the target resource. {
"@context": "http://www.w3.org/ns/anno.jsonld",
"id": "http://example.org/anno1",
"type": "Annotation",
"body": "http://example.org/post1",
"target": "http://example.com/page1"
} |
More complex annotation models Real life situation require mechanisms to cope with other factors:
|
An attempt to nest TEI markup inside OA mapped to XML (from earlyPrint project) <annotation-list>
<annotation-item id="A77567_04-000340" creator="earlyPrint" visibility="public" generator="import"
status="accepted" created="2016-01-01">
<annotation-body type="TEI" subtype="regularization" format="text/xml">
<orig>Pauls</orig>
<reg>Paul's</reg>
</annotation-body>
<annotation-target source="A7e0c97f9-60a1-43d7-9b0e-7f4ec7ad39ac" version="1">
<target-selector type="IdSelector" value="A77567_04-000340"/>
</annotation-target>
</annotation-item>
<annotation-item generator="earlyPrint" id="Ad93fcf5b-7174-4883-9934-06209eaa80ee" status="pending" visibility="public" creator="shcuser" created="2016-07-05T19:38:09.944Z" modified="2016-07-05T19:38:09.944Z" class="style-scope annotation-list">
<annotation-body subtype="update" type="TEI" format="text/xml" class="style-scope annotation-list">
<w class="style-scope annotation-list">Widower</w>
</annotation-body>
<annotation-target source="A7e0c97f9-60a1-43d7-9b0e-7f4ec7ad39ac" class="style-scope annotation-list">
<target-selector type="IdSelector" value="A77567_04-000640" class="style-scope annotation-list"/>
</annotation-target>
</annotation-item>
</annotation-list> |
Shouldn't the |
Just a note: the principles worked out in stdfSpec have now been put to use in several projects. They are not at odds with the WADM and are compatible with ISO proposals. I trust that these principles will form the core of whatever is being proposed here. |
Another example from a standoff proposal of ~2015, never really used in practice. Idea was to use TEI markup enhanced with @stf_target or @stf_from/@stf_to anchoring it to base text to encode different layers (each a valid OHCO) <standoff>
<stf xml:id= “stf_name”>
<persName stf_target="#w119" ref="#Morgain"/>
<persName stf_target="#w132" ref="#Lancelot"/>
<persName stf_target="#w320" ref="#Yvain"/>
<persName stf_from="#w323" stf_to="#w325" ref="#DukeClarence"/>
</stf>
<stf xml:id= “stf_hi”>
<hi stf_from="#w1" stf_to="#w10" rend="rubric"/>
</stf>
</standoff> |
Like alluded to by @bansp , we already have quite a couple of project implementing the current proposal. You have a NER scenario under https://github.com/laurentromary/stdfSpec/tree/AnnArbor/Scenarios . In particular, it would be good to keep to the naming agreed at the Ann Arbor council meeting (standOff, listAnnotation, annotationBlock). We should focus on defining a stable content model (see the analysis at the end of the scenario document). |
Standoff representation of apparatus variorum in Digital Mishnah project <app xml:id="app.4.1.1.2.5">
<rdgGrp n="1">
<rdg wit="#P00001">
<ptr target="#P00001.4.1.1.2.5"/>
</rdg>
<rdg wit="#P00002">
<ptr target="#P00002.4.1.1.2.5"/>
</rdg>
<rdg wit="#S01520">
<ptr target="#S01520.4.1.1.2.5"/>
</rdg>
<rdg wit="#S07106">
<ptr target="#S07106.4.1.1.2.5"/>
</rdg>
<rdg wit="#S07204">
<ptr target="#S07204.4.1.1.2.4"/>
</rdg>
<rdg wit="#S07319">
<ptr target="#S07319.4.1.1.2.6"/>
</rdg>
<rdg wit="#S07326">
<ptr target="#S07326.4.1.1.2.5"/>
</rdg>
<rdg wit="#S08174">
<ptr target="#S08174.4.1.1.2.5"/>
</rdg>
</rdgGrp>
<rdgGrp n="empty">
<rdg wit="#S00483"/>
</rdgGrp>
</app> |
@laurentromary @bansp I was charged by the TEI Council yesterday with gathering existing standoff approaches for further discussion. I would appreciate having examples from the projects you mention here, ideally making it clear how they are WADM conformant. |
This is what you will find under https://github.com/laurentromary/stdfSpec/tree/AnnArbor/Scenarios |
I was reviewing this issue within context with #1833 and accidentally managed to close it—sorry! It is reopened now. I do agree with @sydb that the use case @joeytakeda is providing on that ticket relate to the examples being reviewed here as stand-off annotation. |
Status update: we have a workgroup on standoff that involves @laurentromary and Council representatives. Specific discussion is taking place in standoff proposal issue tracker https://github.com/laurentromary/stdfSpec/issues. Currently we are working on creating good examples for several specific use cases we have defined and intend to work on elaborating standoff and listAnnotation content models incrementally. |
As this discussion is motivated via Web Annotation and it seems the discussion will converge to yet another TEI insular solution: What is the objective for not following W3C recommendations regarding embedding Web Annotation in markup languages? See https://www.w3.org/TR/annotation-html/. |
related to #374
History
Subject of standoff is discussed in this tracker since 2012 and must have surfaced much earlier in unrecorded conversations. Initial agreement was of a
<standoff>
element envisioned as a sibling of the<teiHeader>
and<text>
which would serve as a wrapper for individual annotations. Convincing statement about what is meant by annotation is given in #374 (comment) and Council meeting of 2012-09 agreed with that, without determining exact content model, except postulatingFurther work on standoff has been delegated to workgroups and LingSiG but until now we do not have any conclusion. Meanwhile other attempts at standardization of annotations have been carried out by Open Annotation Community group, later resulting in W3C specification of Web Annotation Data Model
https://www.w3.org/TR/annotation-model. Council agrees that TEI annotation model should be compatible with OA recommendations.
Current standoff proposal can be found at http://htmlpreview.github.io/?https://github.com/laurentromary/stdfSpec/blob/AnnArbor/Scenarios/StandOffScenarios.html
The text was updated successfully, but these errors were encountered: