-
Notifications
You must be signed in to change notification settings - Fork 11
The behaviour attribute value doesn't specify it's parsing. #8
Comments
When attempting to parse a |
in tei-pm, there should be a datatype for each parameter of a function. That should deal with this? XPaths are not quoted, strings are. |
For example how is a |
um. we have no idea! we don't know how we'd handle that in XSLT. |
So are strings assumed to be XML encoded, so a string of |
That doesn't help you, because the XML parser expands the entities into On 24 March 2015 at 15:59, Matthew Buckett [email protected] wrote:
Sebastian Rahtz Director (Research) of Academic IT University of Oxford IT Services 13 Banbury Road, Oxford OX2 6NN. Phone +44 1865 283431 Não sou nada. Nunca serei nada. Não posso querer ser nada. À parte isso, tenho em mim todos os sonhos do mundo. |
This came about because an XPath expression may contain a comma (I think) so I was thinking about how to parse the function to extract out the 2 XPath expressions for |
ah. I see where you are going how. and I just met a similar problem. I am beginning to think we should change this spec to say that the XPath expression should be passed as a string, i.e. surrounded by quotes. Doesn't help with how to pass quotes, but does deal with the embedded comma. |
I suggest elements should be used instead of attributes (for behaviour and predicate). Otherwise I think |
Since this is XPath 2, we have the codepoints-to-string() function, but it's not pretty. "concat(codepoints-to-string(34), 'Hello', codepoints-to-string(34), ' said the policeman')" |
It's a fair point, Conal. I don't want to change horses mid-race when the problem right now is |
I've compared the TEI Simple dtd with the DTA schema. Simple is more generous than DTA, but DTA has the following elements that Simple does not allow for: addName Should we include them? I can see three different arguments in favour of doing so. First, DTA has been adopted by CLARIN as its base format. Other things being equal, there is a benefit if a text in that format validates under Simple. Second, and perhaps more substantively, named entity extraction seems to be the chief, and often the only, thing that people are interested in when they work with texts. Third, when I showed Simple to the Perseus folks, they were very interested in the processing model but objected to the exclusion of the name elements. On the minus side, you can just use type attributes for sub specification of names, and Simple may run the risk of no longer being simple. Do we want to slide down that slippery slope? |
I think we quite consciously have made the decision of excluding 'syntactic On 8 April 2015 at 15:19, martinmueller39 [email protected] wrote:
|
the naming thing is hard. we can put back all the specific ones, but then we'd have to remove the generic @type version. would that actually be better? i.e. not to support at all? the conversion stylesheet is now in the TEI Stylesheets |
There's no specification on how the
behaviour
attribute's value should parsed. How should strings, URIs and XPath expressions should be quoted.The text was updated successfully, but these errors were encountered: