This HTML5 document contains 18 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

Namespace Prefixes

PrefixIRI
n8http://demo.openlinksw.com/about/id/http/dragonfly.hypotheses.org/91/
wdrshttp://www.w3.org/2007/05/powder-s#
dchttp://purl.org/dc/elements/1.1/
n2https://dragonfly.hypotheses.org/
rsshttp://purl.org/rss/1.0/
n7http://demo.openlinksw.com/about/id/http/www.dragonfly.hypotheses.org/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
contenthttp://purl.org/rss/1.0/modules/content/
xsdhhttp://www.w3.org/2001/XMLSchema#

Statements

Subject Item
n2:361
rdf:type
rss:item
dc:creator
Christof Schöch
wdrs:describedby
n7:91 n8:
dc:date
2013-06-23T17:53:28Z
dc:subject
My research Balzac novel Bibliographix CATMA Zotero annotation Nice description
rss:title
Annotation, or: it doesn’t always have to be quantitative analysis
rss:link
https://dragonfly.hypotheses.org/361
rss:description
Up to now – and this means for its first year, my first post here dating from 12 months ago – this blog has been primarily concerned, with respect to text analysis, with quantitative approaches. However, this is of course only one part of computational text analysis, and computationally supported manual annotation of texts is another one. This is what...
content:encoded
<p>Up to now &#8211; and this means for its first year, <a title="The Dragonfly’s Gaze" href="http://dragonfly.hypotheses.org/1">my first post here</a> dating from 12 months ago &#8211; this blog has been primarily concerned, with respect to text analysis, with quantitative approaches. However, this is of course only one part of computational text analysis, and computationally supported manual annotation of texts is another one. This is what this post is about.</p> <div id="attachment_363" style="width: 310px" class="wp-caption alignleft"><a href="http://dragonfly.hypotheses.org/files/2013/06/catma.png"><img aria-describedby="caption-attachment-363" loading="lazy" class="size-medium wp-image-363" src="http://dragonfly.hypotheses.org/files/2013/06/catma-300x134.png" alt="CATMA - Computer Aided Textual Markup &amp; Analysis" width="300" height="134" srcset="https://f-origin.hypotheses.org/wp-content/blogs.dir/857/files/2013/06/catma-300x134.png 300w, https://f-origin.hypotheses.org/wp-content/blogs.dir/857/files/2013/06/catma-500x224.png 500w, https://f-origin.hypotheses.org/wp-content/blogs.dir/857/files/2013/06/catma.png 1165w" sizes="(max-width: 300px) 100vw, 300px" /></a><p id="caption-attachment-363" class="wp-caption-text">CATMA &#8211; Computer Aided Textual Markup &amp; Analysis (not just for girls, obviously.)</p></div> <p>To be perfectly honest, I am indeed more interested in quantitative text analysis right now than in manual annotation. But a large part of <a href="http://www.christof-schoech.de/description-double-roman">my Ph.D. thesis</a> was an exercise in computer-assissted manual annotation of literary descriptions in French Eighteenth-Century novels, and <a href="http://www.christof-schoech.de/poetiques-du-descriptif">I have recently had the occasion to go back to this technique</a> and apply it to a small number of nineteenth-century novels. The basic question was to find out what kind of relation there exists between descriptive techniques in the Enlightenment novel on the one hand, and in the realist novel on the other.<span id="more-361"></span></p> <p>Compared to my work for the Ph.D. thesis, many things were different. First of all, instead of working my way through 32 novels, here I was just dealing with two, namely with Balzac&#8217;s <em>Eugénie Grandet</em> and his <em>La Peau de Chagrin</em>. Also, instead of dealing with a variety of different issues, here I was just dealing with one very specific issue, namely the techniques of integration of descriptions into their narrative context.[<a href="https://dragonfly.hypotheses.org/361#footnote_0_361" id="identifier_0_361" class="footnote-link footnote-identifier-link" title="Relevant aspects of this are: implicit and explicit techniques; the types of explicit arguments used to justify or legitimize a decription; the types of events and circumstances used to implicitly naturalize or motivate a description; the position and intensity of such events and circumstances; the object of the description; the narrative form of the context; etc.">1</a>] And, most importantly, I was using the typology of such techniques which was the result of a long process in the thesis and simply &#8220;applied&#8221; it to two more novels. Of course, part of the aim was to see how well such a typology, developed for the eighteenth century, would work for the nineteenth.</p> <p>The last difference is that, while I was working with <a href="http://home.mybibliographix.com/">Bibliographix</a> during my thesis (a clear case of useful tool abuse), I am now not working with Windows any more and therefore need another solution. Two came to mind: <a href="http://www.catma.de">CATMA</a>, developed at the University of Hamburg, on the one hand, and <a href="http://zotero.org">Zotero</a> developed by the Roy Rosenzweig Center for History and New Media. Both are web-based[<a href="https://dragonfly.hypotheses.org/361#footnote_1_361" id="identifier_1_361" class="footnote-link footnote-identifier-link" title="Zotero was born as a web-based solution and later developed a Firefox plugin and standalone version, while CATMA used to be a desktop application but is now, i.e. since version 4 released in April 2013 web-based.">2</a>] and therefore platform-independent, which is nice. So I tried out both of them!</p> <p>I started with CATMA, the textual markup and analysis tool. What appealed to me in CATMA was the possibility to identify very specific portions of text to add tags to them, and the possibility to do nicely complex queries on the text. Also, CATMA is really built for my use case. So, I loaded my two novels into CATMA, then reading the texts, identifying descriptions, and categorizing the descriptions as a whole and smaller parts of them according to my now well-established categories. It sounds easy and really is not very complicated, but there are some not so intuitive details you need to take care of. Tag libraries need to be established before you start tagging, and they need to be loaded and activated for each document you want to tag, and the web-based interface is at times a bit slow to react; I had to cut my novels into several pieces to work comfortably with them. And the same goes for the query functions &#8211; very poweful but a bit clunky to use.</p> <p>After going through all of my novels with CATMA, and wanting to do quick searches for certain combinations of features and see the resulting descriptions, I decided to try Zotero as an alternative approach. I know Zotero well for using it as a bibliographic database in various research contexts, both personal (Bibliography on <a href="http://www.zotero.org/groups/literary_description_-_a_research_bibliography">Literary Description</a>) and official (DARIAH Bibliography on <a href="http://home.mybibliographix.com/">Doing Digital Humanities</a>), so this was an easy choice. And I had already identified all of the descriptions in my two novels and was able to easily grab them from CATMA.</p> <p>So I took the descriptions from CATMA and entered them into a fresh Zotero collection one by one, adding each individual description as a new item and putting the text into a &#8220;note&#8221; (that took a while). Then, I went through all of them, marked relevant passages with different colors and added tags for all kinds of phenomena to each entry. The upside of Zotero was that it is easy to create tags as you go along, and that is quite snappy when you use the Firefox plugin or the standalone version (the purely web-based version is also a bit slow and limited). The downside is that it is not possible to add tags on a textual level, but only on the item level. Also, working in this way,  I did not have the full text of the novel at my direct disposal, so there is a certain effect  of de-contextualisation.</p> <p>So, what is my personal bottom line? Zotero is simple and snappy and I know it well, but it is not really designed for my use case. CATMA, on the other hand, is designed exactly for my needs with this little research project, but I did not have the patience to stick with it. Once some of the rough edges get smoothed out, however, CATMA is undoubtedly the more adequate and more powerful tool.</p> <p>And what was the research result? The typology developed for the eighteenth century worked well on Balzac, but some interesting differences showed up. For example, the ninetheenth century novel has a reputation (established by Philippe Hamon) for having a preference for symmetrical implicit integration techniques (such as a sequence like: opening of a door &#8211; description of person appearing in the doorframe  &#8211; closing of the door). Such symetrical sequences are quite rare in the eighteenth century, where very simple configurations dominate and complex ones are almost always asymetrical. And it turns out, these symetrical configurations are also quite rare in the two Balzac novels I studied. Whether this is true more generally would need to be decided on the basis of a much larger sample. And indeed, I would argue, this would need to be done in an at least partly automated manner, i.e., in a way that combines qualitative analysis and quantitative techniques, possibly through a combination of rule-based annotation and machine-learning.</p> <p>In other words, it doesn&#8217;t matter whether it&#8217;s manual or automatic, qualitative or quantitative, as long as it is computational. ;-)</p> Notes<ol class="footnotes"><li id="footnote_0_361" class="footnote">Relevant aspects of this are: implicit and explicit techniques; the types of explicit arguments used to justify or legitimize a decription; the types of events and circumstances used to implicitly naturalize or motivate a description; the position and intensity of such events and circumstances; the object of the description; the narrative form of the context; etc.</li><li id="footnote_1_361" class="footnote">Zotero was born as a web-based solution and later developed a Firefox plugin and standalone version, while CATMA used to be a desktop application but is now, i.e. since version 4 released in April 2013 web-based.</li></ol>