Like Penelope, the poet entertains the illusion, if only momentarily, of a choice between bringing a creative work into form or allowing it to come undone.

A weaver of language, Graham subtly, deftly, but unsuccessfully attempts to delay the inevitable moment in poetic creation in which complexity of thought adopts form through language, and so realized is also reduced.

However, the necessarily reductive methodology of sorting poetic language into relatively stable categories, as topic modeling suggests, yields precisely the kind of results that literary scholars might hope for - uses of language that, having taken form, are at the same moment at odds with the laws of their creation.

In the following article, I suggest that topic modeling poetry works, in part, because of its failures. Witmore explains that what makes a text a text - its susceptibility to varying levels of address - is a feature of book culture and the flexibility of the textual imagination. We address ourselves to this level, this work, and think about its relation to some other.

Topic modeling with LDA first captured my attention as a possible way to ask discovery-oriented questions about a genre of poetry called ekphrasis - poems written to, for, or about the visual arts.

Contemporary critical models of ekphrasis define the genre through the identification of recurring tropes invoked by poets confronted by the differences between linguistic and visual media. Drawing from a longstanding tradition of competition between poets and painters and between verbal and visual arts, our most recognized critical model for ekphrasis turns on an axis of difference, otherness, hostility, and competition.

LDA, then, offered an attractive alternative for asking questions about the ekphrastic tradition for two reasons.

First, as a computational method it allowed me to cast a much wider net. Rather than selecting from just a few poems, LDA allowed me to cast my net as wide as 4,500 poems.

Second, both LDA and our existing model of ekphrasis assume that latent patterns of language, when discovered, can be used to describe the corpus as a whole.

Therefore, the rationale for deploying LDA as a method of discovery and as a means of understanding the contents of large corpora of texts begins with a similar set of assumptions. For example, LDA assumes that text documents in large corpora tend to draw from categories of language that are associated with the subjects of those documents.

The process is not unlike the critical assumptions made about ekphrasis - that it draws repeatedly from the same tropes and conventions.

I will return to this example throughout the article to illustrate how highly figurative language texts such as poetry respond to LDA differently than texts that strive for more literal meaning. Many of your neighbors rave about the quality of the produce there, but you would like to know what kinds of produce are available before you decide to drive across town to try it out.

One Saturday morning, your neighbors leave madlow the market with empty baskets and return with full baskets. Since it is happens to be late summer in our fictional story, your neighbors select from 10 types of produce that are available at the market: early Gala and Granny Smith apples, butternut squash, Bosc pears, and one neighbor even snatches up the last pint of blueberries.

One by one as your neighbors return, you survey pyrami contents of their baskets. Maslow pyramid johnson dictionary more and more maslow pyramid and revising your predictions, you reconsider based on which produce appears together in a basket the most frequently how to pyranid the 10 produce types.

As more neighbors arrive, with baskets to examine, you can refine your predictions about what the maslwo selection of produce have been at the market. Each author chooses to varying degrees how much of each kind of topic they use for each document; however, the number of total available maslow pyramid, just like the total number of kinds of produce remains constant.

While this constraint, the assumption that all the words in a corpus could be derived from a limited set of topics, strikes the human reader as an artificial limitation, it is a necessary constraint in order for LDA to work. LDA attempts to describe the distribution of topics in a collection of texts in the same way that you predict the types and quantities of produce at the market.

You were able to predict that there were more apples and pears at the market than there were blueberries and tomatoes because across the whole sampling of baskets there were more apples and pears and fewer pints of blueberries. There is one significant difference, however, between the human topic model example and the algorithm. LDA does not produce names for the topics it discovers or sort words with an understanding of what words maslow pyramid. Imagine that while you are sorting through baskets, you come across an Asian pear.

You make note of that, set it in either the apple or pear group temporarily, knowing that you will come back to it after you have gathered more information and continue to sort through baskets. Over the remaining baskets, Maslow pyramid pears tend to appear maslow pyramid other baskets where there are also other kinds of bayer 2013 more often than in baskets where there are also apples.

As a result, you come to the conclusion that, since Asian pears frequently appear in baskets with other pears, the Asian pear in each future basket should be sorted with the pears. This method of determining how to sort Asian pears reflects the manner in which LDA assigns words to topics, according to the other words that are found in the same document. Although the algorithm cannot account for what words mean, like your method of discovery about Asian pears, LDA does a surprisingly good job of sorting words based on co-occurrence.

However, LDA sorts words into topics based on prior knowledge that there are a finite number of topics in the corpus - much the same way that you knew to look for 10 types of produce. David Blei, credited with developing LDA and probabilistic topic modeling methods, describes topic models the following way: Topic models have been developed with information engineering applications in mind.

As a statistical model, however, topic models should be able to tell us something, or help us form a hypothesis, about the data.

What can we learn about the language (and other data) based on the topic model posterior. In other words, once a collection has been created, LDA can test our assumptions about what topics are discoverable. What drew me to LDA as a tool for discovering latent patterns of language use in ekphrastic poetry was that it seemed particularly well-suited to identifying the tropes of ekphrastic discourse.

One could reasonably expect that since the language of stillness, breathlessness, desire, and competition are commonly found in ekphrastic poetry, that LDA might be able to locate ekphrastic poems within a much larger corpus - in this case poems.

This is the question that began Revising Ekphrasis, a digital topic modeling and corpus discovery project I developed that uses maslow pyramid and computational tools to explore ekphrastic and non-ekphrastic poetry.



