Hypothesis

10 Matching Annotations

Nov 2018
www.futurelearn.com www.futurelearn.com

Part 4: frequency data, concordances and collocation - Corpus Linguistics: Method, Analysis, Interpretation - Lancaster University

2
1. Mr.Green 05 Nov 2018
  
  in Public
  
  The formula for calculating normalised frequency is: (observed frequency of your search term x basis of normalisation e.g. 1 million) / Total corpus size.
  
  Formula for normalized frequency
  
  CPL
2. Mr.Green 05 Nov 2018
  
  in Public
  
  So here we come to know that all linguistic features are related with culture.
  
  CPL
Visit annotations in context

Tags

CPL

Annotators

Mr.Green

URL

futurelearn.com/courses/corpus-linguistics/6/steps/370723
www.futurelearn.com www.futurelearn.com

Part 2: annotation and mark-up - Corpus Linguistics: Method, Analysis, Interpretation - Lancaster University

1
1. Mr.Green 04 Nov 2018
  
  in Public
  
  Annotation can be defined as “The process of adding […] interpretive, linguistic information to an electronic corpus of spoken and/or written language data” (Leech 1997). So annotation involves adding interpretive linguistic information to a text (e.g. part-of-speech). Markup provides non-linguistic objective, verifiable information (e.g. author, paragraph boundary). Tagging is a process of using specific conventions (tags) to a text for annotation/markup purposes (e.g. XML tags). The key distinction between annotation/markup is the type of information they add to a text. Tagging is then a method of annotation/markup.
  
  And there goes three of them seen, identified and unambiguous: Annotation-interpretative, markup-metadata, and Tags-conventions.
  
  CPL
Visit annotations in context

Tags

CPL

Annotators

Mr.Green

URL

futurelearn.com/courses/corpus-linguistics/6/steps/370721/comments
www.futurelearn.com www.futurelearn.com

Part 3: types of corpora - Corpus Linguistics: Method, Analysis, Interpretation - Lancaster University

4
1. Mr.Green 04 Nov 2018
  
  in Public
  
  (https://corpus.byu.edu/coca/). You can access the BNC free of charge through CQPweb (https://cqpweb.lancs.ac.uk/).
  
  corpus English
  
  cpl
2. Mr.Green 04 Nov 2018
  
  in Public
  
  (https://www.birmingham.ac.uk/schools/edacs/departments/englishlanguage/research/projects/clic/index.aspx)
  
  now comes my thing. literary corpus of linguistics.
  
  CPL
3. Mr.Green 04 Nov 2018
  
  in Public
  
  There's the BAWE corpus (about 6 million words) and the Cambridge Academic English corpus (CAEC) (almost 4 million words). I am not sure whether those corpora are small enough for your research purposes. You can have a look at the list of academic corpora available through Sketch Engine (https://www.sketchengine.eu/user-guide/user-manual/corpora/corpora-list/).
  
  corpus
  
  CPL
4. Mr.Green 04 Nov 2018
  
  in Public
  
  if I am comparing a corpus of the work of Robert Burns with a general corpus of Late Modern English, does it make sense to remove Burns texts from the general corpus?
  
  extracting and analyzing specialized from and against General
  
  CPL
Visit annotations in context

Tags

CPL

cpl

Annotators

Mr.Green

URL

futurelearn.com/courses/corpus-linguistics/6/steps/370722
www.futurelearn.com www.futurelearn.com

Part 2: annotation and mark-up - Corpus Linguistics: Method, Analysis, Interpretation - Lancaster University

3
1. Mr.Green 04 Nov 2018
  
  in Public
  
  Markup is the way of adding annotation to the text. Tags are a part of the annotation itself. If you are applying a standard tagset (e.g. POS tags, USAS tags)
  
  Tags are part of markup and used to annotate the text/corpus.
  
  CPL
2. Mr.Green 04 Nov 2018
  
  in Public
  
  corpora are collections of natural language that are stored in electronic format. They provide evidence to help us identify patterns, trends and changes in language use that we might otherwise not be able to identify
  
  wonderful use of corpora to develop:linguistic theory, test and validate hypothesis. For the last two, all that we need is data and corpus means data (enormous).
  
  CPL
3. Mr.Green 04 Nov 2018
  
  in Public
  
  I would argue that the difference between annotation (tagging) and markup, is that markup involves adding information (either linguistic or non-linguistic) to a text, whilst linguistic annotation explicitly involves adding linguistic information (and in this sense, might be viewed as a sub-category of markup as an umbrella term).
  
  So, here's the trouble: markup and annotations are two different guy.
  
  CPL
Visit annotations in context

Tags

CPL

Annotators

Mr.Green

URL

futurelearn.com/courses/corpus-linguistics/6/steps/370721

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL

Tags

Annotators

URL