514 Matching Annotations
  1. Nov 2021
  2. persianlustertiles366646663.wordpress.com persianlustertiles366646663.wordpress.com
    1. I used PivotTable to summarize my data

      Is this working well for you? It would be a real gift to the other members of this class if you were to include the PivotTable tool in your final presentation. It's a digital tool, after all!!!

    2. nationwide telegraph network by British engineers and getting the exclusive concession of excavating ancient sites in Iran by French archaeologists

      Wow! Fascinating. How might you begin to draw all of this information together to make an argument about the trade in Persian Luster Tiles?

    3. Is there a distinction between the things gathered by women and men?

      Good question! How might you go about constructing an answer? Be mindful that questions of gender require a direct take on its social construction in the period in question.

    4. Figure 3 demonstrates that Americans, with 31 collectors, have the highest percentage of luster tile purchases.

      This is all great information! It is time now to answer a few "so what" questions.

    5. most significant time in the luster tile market was between 1880 and 1920,

      What are your arguments for why "most number of artifacts" implies "most significant time?"

    6. Figure 1 shows that activities of the luster tile market started in the 1870s and dramatically increased in the 1880s. It continued in the following decades until 1940. From 1940 until 1970, activities were limited to bequest, In other words, in these thirty years, collectors purchases were inherited to their legal heirs or museums. in the next five decades, objects mostly gifted to museums. Unlike the early decades when dealers sold their tiles to collectors and museums, in these decades, collectors mostly gifted or sold their tiles to museums.

      Thank you for this! Have you started to think about what the implications of these patterns might be? The "bequest" era appears to be coincident with WWII and its aftermath, for instance...

    7. The process of removing luster tiles from Iran and distributing them in the art market consists of unsorted data in museum databases, articles, catalogs and itineraries

      This sentence is unclear.

    1. I realized thinking about how the tool can advance my hypothesis instead of how the hypothesis can fit within the tool was a critical revelation that led my thinking for the third iteration.

      Yes, indeed. As I note in the syllabus...it's not that one must always start with a hypothesis, but rather it is important to know that a certain amount of expertise is necessary to start with tools. You all are gaining some of this expertise over the course of this term!

    2. killing your darlings

      You truly changed hearts and minds on presentation day with your commitment to this strategy. Thanks for being so clear about the benefits of recognizing when change is not only good but salutary!!!

    3. , it led me to be creative and strategic in organizing my hypothesis. To me, being creative and strategic means giving the audience a seamless browsing experience with mindful data

      Excellent clarity.

    1. worried I won’t do it before our time together ends.

      You have a career to keep working on this. Please do believe me that it is an important hump to surmount, though. Computers are created by humans, every.step.of.the.way. Trust them as much as you'd trust a human who has never lived a life outside a metal box for a single day...

    2. Is this a better breakdown of the text? Should I explore further analysis with this format? I’m not sure!

      These visualizaitons are NOT the end of the road for your analysis. They are pointing the way not answering your question...so what do they point to?

    3. consistently higher negative sentiments does not mean that the text itself is inherently negative

      I hope you've been paying attention to Yujin's work...you have come around to similar realizations about the relationship between "count" and "meaning." Just because a word isn't used a lot doesn't always necessarily mean it isn't a crucial component to a text.

    4. I dislike how long it took me to make the connection.

      Humans and their own expectations of themselves...it's a thing. Would it be better to never have made the connection!!??

    5. detecting topic patters among those sentiments

      I bet this is where the machine learning comes in--or even somewhat simpler statistical model like LDA (Mallet's underlying statistics).

    6. His example makes sense though; the order of words matter in “Because Apple was doing poorly, Steve Jobs…” versus Apple was doing poorly because Steve Jobs…”.

      Take that "bag of words" model! Have you read Matt's topic modelling piece yet? It's good on this.

    7. parts of speech as I understand them are rather void of emotion

      I'm no expert here, but I suspect that "love" as a verb and "love" as a noun can be an important distinction to make--to take a possibly clearer example, "hold" as a verb and "hold" as a noun.

    8. until I get the computer to agree with me

      Why would you want this?? If the computer agrees with you, and spits out what you want, why go through the bother? This is a really serious question as you note in your next sentence!!!

    9. I disagree with the computer

      What happens if you disagree with a coder? Are these two questions related in your mind (computer v. coder)? They should be...

    10. replacing “user” with “narrator” or “character,”

      This here is not a "replacement" to be taken lightly. I feel sure you know it. The Amazon reviewer who is the subject of Christine's research isn't the same sort of being as the character of Jane Eyer--on any number of very important levels.

    1. they were not quite right because I knew what I was looking for

      Again, do watch out for accusations of confirmation bias. Computer boys love to come down on humanists for this (we talk in the "Roles" paper about this...you're reading it for the Monday after break)

    2. it only does so because I know what I'm looking for.

      Yeah, but to the trained reader, it can also just "point." If we ask LDA to point out patterns in a corpus that is unfamiliar, we can still gain utility...so long as we do not feel that the computer sees something we would see if we tried long enough. No, this is a collaboration with an alien, and sometimes that's helpful to see outside our assumptions.

    3. I used the topics generated by my run of MALLET to engage my boyfriend and my sister in conversations about copyright law and was able to teach them about the law in ways that I was never able to communicate well prior

      Amazing.

    4. lack of civic understanding

      Is it that they didn't understand or they were hopeful that they'd make big bucks if their copyright was approved no matter what?

    5. I refer you to the letter

      Here, and above...why not take advantage of hypertext to offer a link rather than asking us to scroll down? This is, after all, not a book or a scroll in actual fact :)

    6. useful objects are not

      By voice, you've told me that you know what this actually implies, but the way you've written this here implies that you're naive to the fact that LDA is gesturing towards the fact that what is NOT copyrightabke has been mentioned as much as what IS copyrightable.

    7. At this stage in my work, I do not consider the topics generated by MALLET to be the evidence that will either support or refute my thesis

      OF COURSE NOT. This would imply that the computer is giving you the answer when that isn't what is going on.

    8. It depends largely on how I define the many vague terms that comprise my thesis and it depends on how one defines the results of my work

      Yes, but such is the nature of all interpretive work. It "depends" on what YOU think. So, do be clear about that. We can accept or reject your premises, but if we don't know what they are, we can't engage at all.

    9. In

      OK, I'd like you to know that right here, I'm left totally hanging about those topics. I don't understand them at all in the way that you do, and you've moved on to another idea!! It's frustrating, frankly.

    10. Looking at these, I felt much more confident that my dataset had been improved to the point that the topic models generated did in fact suggest some meaningful insights.

      Why? How? Think of this work like textual explication...the readers should not be expected to see what you see, indeed, we're here wondering what you (the expert) see.

    11. by some Python scripts that she provided to me

      That do what? That are designed how? In a published paper, there would almost certainly be a note (or a GitHub reference) that suggests what techniques these "some" Python scripts implemented.

    12. dominated by technical struggles

      Par for the course...and will be as you learn each new tool and technique. Difference might be that you later just take this as a given.

    1. . In order to compare Voyant and Docuscope and each of their technological affordances and restrictions, I need to study Docuscope more in depth while practicing the use of it,

      Nice. Yes. This is a good plan...but maybe do a little more reading of the passages Voyant pointed out to you as well.

    2. one reference can be very significant evidence

      Yes! So, now the computer has "found them all" for you. Go and look. See what the computer has pointed out. Make meaning from its help.

    3. number of reference is important to examine my thesis

      Might I suggest that the "number of time someone references sound" is a pointer to something interesting rather than proof of something interesting. What you do with that pointer is the key element to your research!

    4. I found that accounts of wonder that is related to the authors’ experiences of sound are not prevalent in their texts, although they exist.

      So, a finding! Perhaps a disappointing one...but a finding all the same.

    1. I might be missing.

      At this point, I'd be very interested to know--how would you, Mehul, know when you were seeing something in a computer output that you are "missing" through your own reading of these texts. I don't mean this as in "it is impossible for a computer to show you something you missed," I mean this quite literally. How do you recognize something you didn't know? And where does that recognition happen?

    2. In making all of these drawings illustrating the relationship between characters, how their roles are described in establishing traditional and customary knowledge, i was able to uncover one layer of performative orality: the reference to older and customary forms of knowledge that indigenous communities deem important enough to be told and retold on some of their special occasions such as marriages. For example, this will establish which trees were considered sacred and hence, must be planted or taken care of in such relationships.

      Are you satisfied with this finding? How might you integrate it into a larger study?

    1. Documenting decisions that I made while entering data into the tool or adjusting filters in Voyant, I continued to think how I can do my computing or using the tool more mindfully

      It would be fantastic if this information were skillfully weaved into your argument without just plopping in your whole method. If you'd like to talk more about how best to do this, do not hesitate to make an appointment to chat!

    2. do not prevalent

      "are not prevalent." But here again, you are tacitly rather than explicitly relying on the number of references to imply importance. Is that your mindful choice?

    3. studies of acoustics by using music- and sound-making objects.

      Ah, so the context found in the work matters...how do you plan to bring this into your study? I think it's important and will enrich your findings.

    4. as the graphs show

      Where? How? Channel your inner art historian and offer up some formal analysis of these graphs for us so that we can be sure we see what you see.

    5. mostly

      OK. So if the experiences are not "mostly" about sound, does that mean sound takes a backseat to sight? That argument surely might hold water, but is it what you mean? What valence are you ascribing to a "count?"

    6. The results show

      My advice for the next iteration: consider much more deeply how much you are expecting your reader to "read your results off the graphs," rather than you telling us how you have found what you have found. You don't want to leave it this way, as I may have a completely different interpretation of the figures!

    7. there are many passages which depict the authors’ feelings/imaginations of wonder

      Nope...the passages have the words you've identified. It is really important to ensure that you are not overstating your claim.

    8. graphs

      of what? how? Perhaps yesterday's conversation about how visualizations only do what the creator wishes for them to do--and sometimes don't even help make sense--can create a better understanding of how to contextualize "graphs ("Trends")." What type of graph? What does "Trends" do?

    9. Running the Voyant

      I can tell you're going to get into this, but the rhetoric here is that "running Voyant" told you something. But Voyant is a series of discrete tools. Which ones did you choose and why? Instrumentalizing the toolset as a type of deus ex machina isn't the strongest way to start providing evidence for your hypothesis, if you see what I mean?

    10. the asterisk* mark after “wonder,” for example, indicates that I am searching certain terms that start with “wonder” as one term

      There's a technical term for this...and it's probably worth using alongside this great definition. It is stemming.

  3. Oct 2021
    1. While I’m not sure what to do with this capta going forward, I feel the issues lie with my lack of sophisticated analytical skills more so than with the capta itself.

      What scholarly texts have you been reading that might help you out with models? I feel quite strongly that you've not yet engaged with the words in these texts--you appear to still be thinking of the texts as meaningful...so the words must be, but that connection is not as clear as you seem to assume. Why would "shared words" imply "shared meaning?" People do not really parrot one another, and also the context of a word is almost as important as the word itself.

      What is your model of the text that allows the Voyant tools to help you make the meaning you want to make? How does comparison enter this picture?

      Keep going for sure--I might even suggest you make your next iteration a report on what "sentiment analysis" really and truly is-- but you seem to be wandering around still, thinking perhaps that the computer will give you an answer. You will need to do a lot more pre-processing for the computer to help you find the answers you might seek.

    2. Sentiment analysis

      What is sentiment analysis? What parts of it are you adopting and what parts of it are you not?

      I can tell you that it is not "word counting" which is what you do before, so I think you have a lot more work to do here connecting this theoretical model of the text with your approach thus far. You can do it!

    3. I no longer feel that way, perhaps because I no longer feel I could have shaped this work into an academic paper, as I did when focusing on religion in Jane. With that, I felt I was grasping at straws to turn a literature scholarship into a DH project, and I don’t think that’s the case with my current capta set.

      I'd like to hear more about this, both in terms of the assumptions you were bringing in from your past experiences/expectations of publication, and how working with these tools has changed your mind.

    4. So for now, here we are: “Cottage” Cirrus, TermsBerry, and Collocates; “The Search after Happiness” Cirrus, TermsBerry, and Collocates; and Jane Eyre Cirrus, TermsBerry, and Collocates.

      These are visualizations of the whole texts? What pre-processing did you do?

    5. While it makes sense to incorporate Wuthering Heights into this project, I won’t

      Please don't spend any more time now thinking about this! Focus more on really getting to know the tools, using Jane Eyre as something like a "test case."

    6. both recommended expanding beyond the one book

      I still recommend this. Once you get your head around how some of these tools work and what they can and cannot do, expanding your capta to include other texts will be very important.

    7. I determined that Voyant is likely too basic of an interface for the level of sophistication I’m hoping to reach this semester

      But it is a good place to start, and you definitely still have things to learn about it! I would really recommend that you focus on getting to know all that it can do.

  4. persianlustertiles366646663.wordpress.com persianlustertiles366646663.wordpress.com
    1. questions

      All of this is great! I recommend that you start moving from "tinkering" to "argument generation" more seriously during the next iteration. The capta has shown you patterns, now it is up to you to tell us what those patterns mean in context.

    2. visualizing relationships between museums and buildings in Iran revealed that there are some other institutions that have objects from various buildings, but they were overlooked in our study due to a lack of information about their objects.

      This could use further clarification and explication, and I'm looking forward to hearing about it over the next round of iterations!

    3. In addition, matching dates of receive with dated of International Exhibitions, highlighted the role of these global fairs in distributing luster tiles among collectors and museums.

      I think that there could be summative visualizations that help you demonstrate this quite directly--I recommend that you try them during this next iteration!

    4. donations

      I think that I could use more work distinguishing "sales" from "donations." Are all "donations," donations from collectors that bought the tiles via "sales?"

    5. particularly during a four-year period between 1907 and 1910

      ...and that's surprising. Or, ...and that makes sense because [BLANK]. I would advise you, especially in the web context, not to build up un-interpreted findings awaiting a large-scale conclusion. Say what the evidence means as you go.

    6. Except Smith who bought objects for the Victoria and Albert museum, others were collectors.

      You mean that Smith was an agent for the V&A and the others collected for themselves? It might be worth considering in what ways personal collectors saw their habits as being personal and in what ways they saw them as being "public" or "for the public good." Shirin Fozi has experience thinking through these issues!

    7. have tiles ranging from three to four buildings, despite the fact that they were overlooked in our network’s centrality due to a lack of information.

      I don't fully understand this. Please say more and write out/present both your evidence and your interpretation more fully.

    8. highlighted recipients

      Highlighted how? The darker circles? When it comes to explicating figures, it is critical to consider it to be your job to perform a type of formal analysis. Please don't assume that your readers see what you see. You want to demonstrate what you have done, seen, and interpreted from your evidence.

    1. clusters of words

      Given your brief lit review above, surely it will have to be that making the written word a bonafide proxy for the spoken word is, in itself, fraught with peril, is it not? The written word and the spoken word are not the same performatively or rhetorically...so how are you going to work with that? I think it's possible, but am interested in your approach.

      I wonder if for the next iteration you might not work with your own visualizations of these stories, a bit along the lines of the Knowles et al. piece from the other week, or even this week's "Sandcastles" reading.

    2. In my view, it is not the failure of tools but my own inability to develop a theory of memory in these oral texts, that have been translated into written languages on a page

      "Failure" is a very strong word. You are working iteratively to figure something out. There is no failure there until you quit without reaching a conclusion you are satisfied with.

    3. the tool did not yield groups of words related to memory use for me.

      What is the evidence you can provide me to show me what you did, how you interpreted what the computer showed you, and how it was not helpful? You say it was not helpful--and while I believe you, good interpretive research does not rely on the reader believing the scholar. Evidence is required!

    4. lengths of tribal memories and narratives can be so long that the historical time merges with the legendary

      Thrilling! Could you say a bit more about how? Is it solely due to "length?"

    5. all kinds of information, some not necessarily memory oriented or memory based

      Could you say more about the distinction you are making between "stories that contain all kinds of information" and "memory?"

    6. chipping away slowly, paragraph by paragraph, at stereotypes like inferiority of orality

      Interesting! How does she do this, and can you use the same techniques?

    7. However, several scholars have chosen to study the use of memory to store knowledge in oral cultures using mnemonic techniques

      I want to make sure I follow. Is this a restatement of your claim, "Several scholars have successfully studied mnemonic techniques as an oral skill to store knowledge?"

    8. In the case of oral languages, using tools first before developing a theory of memory technologies has not been very fruitful for this project.

      I hear you on this, but the remaining sentences in this paragraph do not directly support your claim. It talks about other things that you need to do...but not why thinking with and through tools (that may or may not work) wasn't instrumental in getting you to this point.

    1. freaking out about the work that I have been procrastinating on.

      Very interesting!!! Please don't expect change to happen over night. The process of learning and knowledge production happens iteratively, over time (even over a career). Keep up the mindful reflection! There will be highs and lows.

    2. I’ve incorporated my offline version of coding the reviews into sections based on my analysis into the WordPress page to demonstrate my methodology and how I categorized each sections

      ??

    3. extra words that shouldn’t count towards analysis

      What are these words and how do you know they "shouldn't count?" See my comment above about "mask(s)" in your word cloud.

    4. it’s becoming more obvious that people in the review section are treating this space as another way to solidify and support their own existing opinions

      Great! How? Why? On what evidence and using what interpretive methods?

    5. A different question of inquiry could look into audience studies that use self-reported survey data that asks reviewers what types of offline activities they have engaged in related to issues of voting, politics, mask practices, etc.

      Would you need to associate this data with particular Amazon usernames?

    6. The review space functions as a type of forum but one where people are not discussing nor debating the issues of health, science, and politics with each other. Instead, Amazon consumers are declaring their own opinions and views to the rest of the consumers and reviewers on that particular product page.

      This makes intuitive sense to me, but you haven't provided me any evidence for why you have come to this conclusion.

    7. I want to pay close attention to the particular words

      OK, great. Are these words you've identified in advance or words you are willing to use digital tools to discover? It's unclear what your order of operations has been--or what you would ideally like it to be.

    8. I had initially hoped to let the capta speak for itself but unfortunately, I, the human, will have the responsibility of working through the capta and interpreting it.

      Wait what whoa?? Really? "Unfortunately?"

    9. non-traditional public spheres

      Again, are digital public spheres "non-traditional" in 2021. Or do you just wish to make a strong distinction between meatspace communication and digitally mediated communication?

    10. commercially owned communicative spheres

      Are there non-commercially-owned digital communication spheres? If so, which? In either event, what is the distinction you might make here between "digital" and "commercial?"

    11. significant

      In what ways does "reoccurring" imply "significance?" Do be precise here. "The" is a frequently recoccuring word and yet its significance varies greatly depending on the researcher's context, no?

    12. What is the review exporter extension’s socio-technical context

      Great question! Who, besides you, wishes to scrape out "all" the Amazon reviews from a product page?

    13. Donald

      Overall, for the next iteration, I'd like you to continue your analysis of these reviews (obviously), but also pay a bit more mindful attention to the ways in which your medium of expression affects your message!

    14. Digital Tools

      This section is very hard to read. I am not sure how to proceed with reading. As with graphic novels, there are ways to ensure a clarity of logical flow without needing there to be successive traditional paragraphs. You might want to look into those habits and techniques!

    15. Reviews in this category are also likely to mention that these masks were made in China,

      How do you contextualize this in the boundaries of the debate you are framing?

    16. dual themes of appropriate protection and inadequate protection provided by the face mask.

      This is intriguing. What do you mean precisely? I think YOU know, but your words here leave me to fill in a huge set of knowledge gaps.

    17. language that is against the political opposition

      This is unclear to me here. In retrospect I think you mean, "language that clearly situates the reviewer in political opposition to those who would not vote for DJT." Is that right? If not, do clarify, and if so, consider clarifying your words-to-meaning connection.

    18. I have identified the theme of anti-science rhetoric through words

      How? By reading them with your eyes? Whatever the method is fine, but do be specific (here and below).

    19. The reviews of the Make America Great Again masks suggest that wearing or buying a mask is understood by the reviewers as an expression of their own politics and understanding of health, as well as in relationship to the perceived politics of those they see as their opposition

      Indeed. This seems to be a reflection of the dominant narrative about these masks. I wonder--in what way do you see your own work contributing specifically to the better understanding of WHY this is true? This belongs in the thesis as well.

    20. particularly interesting in the discussion about health and political commenta

      Why???? I suspect you'll get into this down below, but why the cliffhanger? Is a sense of suspense something you're trying to create? If so, to what end?

    21. Why This Product?

      I'm truly confused here about where this comes from. Is this from Amazon or you! I only really suspect it is you because of the small size of the words to the right!

    22. Amazon?

      These words are about the same size and font as your image of the Amazon page to the right creating visual confusion about what I should pay attention to. Your actual words are tiny in comparison to the words originally found on Amazon's site, so on first glance, I'm inclined to ignore you and focus on these other more prominent, and colorful images/texts. Is this what you are trying to do?

    23. that create these online spaces of public discourse.

      I might argue that this is now "traditional," and therefore unsuprising. Can you create a frame for this argument that convinces me otherwise?

    24. Trump Face Mask Amazon Product

      I find this band of image/text very difficult to read and also I find it difficult to understand my best engagement with this part of your work, FWIW.

    25. Voyant Cirrus Visualization of Captase

      At first glance, including the term "mask(s)" skews this word cloud in favor of the mundane. Is that your goal? Also, just as a point of reference, Word Clouds are notoriously compromised visualizations for general purpose use, so I would be interested to hear more about your interpretation of what this shows!! Here are a few references about the complexities of this viz:

      https://www.alida.com/the-alida-journal/pros-and-cons-word-clouds-visualizations

      https://dhs.stanford.edu/algorithmic-literacy/using-word-clouds-for-topic-modeling-results/

    1. For the upcoming iterations, I will resume experimenting with the digital embodiment of DICTEE as an artist’s book with a possibility of presenting other modalities such as video and audio, inspired by Alison's proposition. 

      Great! You do this! Very much looking forward to your deepening engagement with this process.

    2. Jamaica also helpfully suggested that perhaps there is some way that in its constraints, it could be somehow informative, which I’m increasingly finding it true as I’ve been getting hints from the Dublin Core elements about the question of authorship and editions.

      Wonderful!

    3. This was also accountable to my very limited knowledge of Omeka, and its format was—and still is—dominating the process

      I think your redesign is really good for the purposes of this course. Congrats on your ongoing engagement with Omeka.

    4. extremely useful for a researcher like me

      For the purposes of this class, a greater recognition of the pros and the specific cons of this access would be great to hear.

    5. ince Cha’s experiments largely concern with the layout of the literary text and the language play, the visual presentation does emphasize the materiality of the text. The letters seep into the space, in this case the space of a digital image, created through the use of visual elements that blur the boundary between body and text

      Right here, Sojin, I desperately want to see what you mean with inserted figures--or at least hyperlinks to examples that I can understand. You are your own publisher here, yes! So, the burden of conveying your meaning is not only word-based as it might be with a typewriter or a handwritten paper. You have an entire digital computer with connections to all of these images. Please use them to help you convey the depth and importance of your research in the most effective ways possible!!

    6. actual artist’s books

      The distinction between an "actual" and a "proposed" artist's book means what? Is this not an essential part of the work, but is instead only in the eye of the beholder?

    7. I would be successful if I could discover ever-so-small moments of the continuities between the print and digital embodiments.

      Ooh. Cool. Why would this help you though? I believe it will...but again, don't gloss over the process. Be tediously precise, if you can.

    8. And I see this in a very calculated arrangement of the pages and the layout of the text from front to back, which almost feels controlling at times

      Good. Yujin has spurred you to get much closer to the point of performativity...so do the same for why/how language does this and how that then supports your definition of an artist's book.

    9. embraces performance in the form of text

      By what precise mechanism? Mainly the "breaking of the fourth wall" in the exercises? Autobiography and historical narrative are pretty traditional! I think you're onto something here, but haven't quite yet figured out what it is exactly. Keep going!

    10. language operates at the core of this experiment

      You say why in the next sentence, but here...it seems like a big assumption/thesis/hypothesis! Consider a new ending to this sentence, "...language operates at the core of the experiment because [a few words from you].

    11. I’m assuming that Theresa Cha is working this way.

      Well, is it not that this is a hypothesis, and you are looking for evidence? Otherwise put, what's the difference in your mind between an assumption and a thesis?

    12. what I mean by artist’s book is an object in a book form that the creator takes very intentional choices in the making, which is mediated and reinforced by physical engagement

      I'll be interested to hear if this definition remains the same after your conversation with Kate. It's going to be a tricky genre to define, I feel, and so it will be very important for you to fully believe in your definition.

    1. I don't yet understand those documents well enough to describe them on my own

      Feel free to set up a time to talk this through. I'm happy to help. You aren't solely sent to your home alone to fight through this! You're sent to your home alone to fight until you hit a wall, then you show me the fight and the wall, and I help (so far as I can) over the wall, and the whole thing starts again!

    2. However, the process is "expensive" both in terms of time and computer bandwidth.

      I hope this also is a learning experience for you in a school of computing and information--there really are still computationally-intensive things that happen. The "computer boys" have access to machines that would make Adobe run at the speed of light, but that means little to you in your home...

    3. Ms. Terry Kapral was also generous enough to share with me a short Python script that I am hoping to have the skill to leverage in converting the entire cache Review Board decision letters from pdf to .txt.

      Wonderful.

    4. . The reason for this is that my corpus only contains 48 documents, which is too small a collection (of documents that themselves are rather short) to generate statistically insightful results.

      Yes, but it can point the way.

    5. t was my presumption that the common words would "throw" the statistics.

      So here you think the commonality will skew results, when above, the product names (and their specificty) will??