Category Archives: Data Projects

Hypergraphy as a Garden of Forking Paths

In zeroing in on a specific data set to begin with in my building-up-toward a more fully-conceived project for next Spring, I’ve found it necessary to first demarcate my chosen subject matter. To work backwards so to speak.

The prefix “hyper” refers to multiplicity, abundance, and heterogeneity. A hypertext is more than a written text, a hypermedium is more than a single medium. – Preface to HyperCities

Hypergraphy, sometimes called Hypergraphics or metaGraphics : a method of mapping and graphic creation used in the mid-20th century by various Surrealist movements. The approach shares some similarities with Asemic writing, a wordless open semantic form of writing which means literally “having no specific semantic content.” Some forms of Caligraphy (think stylized Japanese ink brush work) also share a similar function, whereby the non-specificity leaves space for the reader to fill in, interpret, and deduce meaning. The viewer is suspended in a state somewhere between reading and looking. Traditionally, true Asemic writing only takes place when the creator of the asemic work can not read their own writing.

Example work:

https://en.wikipedia.org/wiki/Hypergraphy#/media/File:GrammeS_-_Ultra_Lettrist_hypergraphics.jpg

Jorge Luis Borges was an Argentine short-story writer, essayist, poet, translator, and librarian. A key figure in the Spanish language literature movement, he is sometimes thought of as one of the founders of magical realism. He notably went blind in 1950 before his death. In his blindness, he continued to dictate new works (mostly poetry) and give lectures. Themes in his work include books, imaginary libraries, the art of memory, the search for wisdom, mythological and metaphorical labyrinths, dreams, as well as the concepts of time and eternity. One of his stories, the “Library of Babel”, centers around a library containing every possible 410-page text. Another “The Garden of Forking Paths” presents the idea of forking paths through networks of time, none of which is the same, all of which are equal. Borges goes back to, time and again, the recurring image of “a labyrinth that folds back upon itself in infinite regression” so we “become aware of all the possible choices we might make.”^[88]

The forking paths have branches to represent these choices that ultimately lead to different endings.

Borges is also know for the philosophical term the “Borgesian Conundrum”. From wikipedia:

The philosophical term “Borgesian conundrum” is named after him and has been defined as the ontological question of “whether the writer writes the story, or it writes him.”^[89] The original concept put forward by Borges is in Kafka and His Precursors—after reviewing works that were written before Kafka’s, Borges wrote:

If I am not mistaken, the heterogeneous pieces I have enumerated resemble Kafka; if I am not mistaken, not all of them resemble each other. The second fact is the more significant. In each of these texts we find Kafka’s idiosyncrasy to a greater or lesser degree, but if Kafka had never written a line, we would not perceive this quality; in other words, it would not exist. The poem “Fears and Scruples” by Browning foretells Kafka’s work, but our reading of Kafka perceptibly sharpens and deflects our reading of the poem. Browning did not read it as we do now. In the critics’ vocabulary, the word ‘precursor’ is indispensable, but it should be cleansed of all connotation of polemics or rivalry. The fact is that every writer creates his own precursors. His work modifies our conception of the past, as it will modify the future.”

I’m circling around 2 or 3 different project ideas:

Close Reading/Qualitative Analysis: Hypertextualizd Borges poems/short stories with an emphasis on works created during his period of blindness, re-imagined as a garden of forking paths. Break down the works into levels of constituent parts. Create an engine to re-esemble them based on a methodological algorithm informed by his ideas surrounding non-linearity, and the morphology of his oeuvre.
1.5 *Potential Visualization Component: Hyperagraphy Engine (simulated blindness) that interacts with the hypertextualized artifacts from 1.0.
Distance Reading/Quantitative Analysis: Topics as “forms of discourse” in Borges and his precursors (Potential Candidates: Cervantes, Kafka, Schopenhauer, Quevedo, Gracian, Pascal, Coleridge, Poe.)
…..(Running out of time, will continue this post tonight).

Taylor

TACIT – A New Tool for Text Collection and Analysis

Thanks to Sava Saheli Singh, whose weekly round-up for the GC’s own Journal of Interactive Technology and Pedagogy brought a new tool to my attention: TACIT, Text Analysis, Collection and Interpretation Tool. From the website:

Though several limited-method tools for text analysis are already available (e.g. LIWC), and some have become part of standard statistical packages (e.g., SPSS Text Analytics), a unified, open-source architecture for gathering, managing and analyzing text does not exist.

The Computational Social Science Lab (CSSL) at the University of Southern California introduces TACIT: An Open-Source Text Analysis, Crawling and Interpretation Tool.
TACIT’s plugin architecture has three main components:

Crawling plugins, for automated text collection from online sources (e.g., US Senate and Supreme Court speech transcriptions, Twitter, Reddit)

Analysis plugins, including LIWC-type word count, topic modeling, sentiment analysis, clustering and classification.

Corpus management, for applying standard text preprocessing to prepare and store corpora.

TACIT’s open-source plugin platform allows the architecture to easily adapt with the rapid developments text analysis.

The tool is available on Github for those interested in checking it out. A related paper can be found on SSRN.

I have not used this tool, so if anyone here tries it out, please report back!

Apollo Moon Mission Photographs

Those of you looking for datasets to imagine might be interested to know that NASA just released scans of 8,400 Apollo Moon Mission photos on Flickr. More on this from Mother Jones.

Related: Grab it: Download photos in bulk from Flickr, Facebook (n.b. I have not tried any of the tools listed here).

Ways that Humanists Think About Data – An alternative text for in-class discussion

Up to this point, I’ve enjoyed our in-class discussions. Typically, I leave with an unfocused, impending fatigue that transforms during my subway ride home into a grounded awareness of the gaps in my thinking about DH theory, what questions I have more generally about how DH fits into the larger context of humanistic inquiry in the academy, as well as a slightly more refined awareness of how I see myself finding my place in the field.

Last week I left, running through potential ideas for my data project, wishing I had articulated the desire for (in an effort to create a lexicon) a more specific discussion about terms related to actual DH projects. I found myself trying to anticipate the unique ways in which humanities scholars think about data. Data sets and maps generally, are obviously representations of a more complex, dynamic, ambiguous world. How have DH practitioners found inspiration in this reality, and what potential solutions and tools already exist? How can the gap between the “real” and the represented be used fruitfully? How can uninterpreted data result in new ways of seeing?

After reading Stephen’s Ramsay’s “Programming with Humanists: Reflections on Raising an Army of Hack-Scholars in the Digital Humanities” I found myself setting aside time to research what exactly went into “word frequency generators” and “poetry deformers”. He mentions a list of tools for analyzing text corpora: tf-dif analyzers, basic document classifiers, sentence complexity tools, etc, as well as natural language processing tools, as potential programs that could be built during a computer science introduction focusing on humanities computing. Hashing out a basic explanation about what these programs do, and potentially a bit about how they do it, would contribute an additional, fruitful dimension to our praxis seminar discussions. I have a sense that learning more about what tools exist would go a long way in helping me zero in on a meaningful dataset.

**As an aside, as I bet not everyone will have had a chance to read this particular article, I should mention that I also really appreciated Ramsay’s extensive list of supplemental reading materials, some of which I have read (The Question Concerning Technology Martin Heiddeger, and others that I would love to spend some time with like NOW, The Work of Art in the Age of Mechanical Reproduction for example.)**

During my research I came across an excellent blog post by Miriam Posner titled Humanities Data: A Necessary Contradiction in which she engages some of the questions that are preoccupying me in lieu of having to choose my dataset. In her blog post she provides a transcript of a talk she gave at the Harvard Purdue data symposium this past summer. Her talk focused on the unique ways that humanists think about data vs say a scientist or a social scientist, and the implications of these differences for librarianship and data curation. I’ll list a couple prescient quotes and a link to her post. If you have some time, check it out!

“It requires some real soul-searching about what we think data actually is and its relationship to reality itself; where is it completely inadequate, and what about the world can be broken into pieces and turned into structured data? I think that’s why digital humanities is so challenging and fun, because you’re always holding in your head this tension between the power of computation and the inadequacy of data to truly represent reality.”

“So it’s quantitative evidence that seems to show something, but it’s the scholar’s knowledge of the surrounding debates and historiography that give this data any meaning. It requires a lot of interpretive work.”

Humanities Data: A Necessary Contradiction

Cheers,

Taylor

Data Project Posts from the 2014 Praxis Class

Hi All,

If you would like to see posts that students in last year’s Praxis course made in connection with the dataset assignment, please look at the posts tagged “dataset” and “data project.”

If you’d like to look through the blog as a whole (used in both the Fall and Spring semesters), please visit http://cuny.is/dhpraxis14. And here is the 2013-2014 class archive, which included a great series of lectures. Perhaps we can talk next week about the different shapes this class has taken over the three-year period of its existence. In the first year, we brought in many guest speakers, but in response to student feedback, we have curtailed that over the past two years.

Digital Praxis Seminar Fall 2015 – Spring 2016

building CUNY Communities since 2009