Course:COGS200/2017W1/NGramAssignment/JovanaDrasko

From UBC Wiki

All terms are in between the years of 1800 - 2000 and smoothness of 3 and corpus English.

Compare Words

Compare Words Graph - Google Ngram Viewer

a) Code: Funny, witty, humorous

b) The graph is all over the place, there is no straight line for each word. There are many peaks and a few deep drops, especially for the word funny and witty between the years 1898 – 1991. For the word funny there is a high stability state shown on the graph between the years 1944- 1991. Otherwise it’s appearance has been increasing since 1964. For the word witty, there you can see multi-stable states between the years 1800- 1832. And one high stability state between the years 1908 - 1926. The word humorous has a small low and high Stability State between the years 1800 – 1844. And then an increase from 1844 to about 1898 and then again between 1917- 1924. Then it decreased from then till 2000 and then seemed to stabilize.

c) There is nothing specifically driving the comparison.

d) Of all the word funny has appeared in the corpus more between 1912 - 2000 than the other words. From all the unigrams contained in this graph the highest percentage is for the word funny. There are many different changes that could affect these words over time.

Wildcard Search

Wildcard Search Graph - Google Ngram Viewer

a) Code: University of *

b) These collective variables seen in the graph show the top ten substitutions in place of a word for the phrase University of *. The graph shows a few low stable states for the University of California, Chicago, Michigan, and Texas. Also the graph shows between the years 1800 to 2000. There is only one high stable state for the substitution of Chicago for University of Chicago between the years 1979 – 1996.

c) In the original wildcard query there is an increase from the year 1800-2000 from the corpus English. There is nothing unexpected driving the effect.

d) For the substitution of Cambridge and Oxford there is a drop between the years of 1900 to about 1940 and then rising from 1941 to 2000. The most popular word following University of is California. There are many different changes that could affect these words over time.

Inflection Search

Inflection Search Graph - Google Ngram Viewer

a) Code: choose_INF a name

b) The verb choose is mostly used in its general form. Collective variables of the word are choosing, chose, chosen, chooses. The modifications are in the tense of the verb. The attractors shown as the verb tense increase or decrease and settle to a general increase only after the year of 1980. Also the graph shows the years between 1800 – 2000. There are many high and low stability states throughout the graph. For example for the basic form choose there a few peaks between the years 1808 and 1873 and 1914 just to name a few. But there is a high stability state between 1958 – 1971. And from 1971 the graph shows at increase for the phrase “choose a name.”

c) In the graph showing all of the forms sum there are no multi-stable systems. After the phrase “choose a name” all other variations, tenses, there is a lower stream of states for the phrase especially between the years of 1965 to 2000.

d) For example, the topics of History, Religion in the years of 1800 – 1901 are shown for all tenses of the word choose. The different tenses how they’ve increased in appearance in the corpus English in this graph show a kind of cultural change, what tense was used most requently.

Search for a word using Part-of-Speech tags

Part-Of-Speech Graph - Google Ngram Viewer

a) Code: jump_VERB_INF, jump_NOUN_INF

b) The term jump in the parts of speech for noun and verb are shown. Also with the different inflictions of the word are shown too. For the noun jump and it’s different tenses shown the graph has a few high and low stability states. Every example of the term jump in it’s different tenses as a verb or noun all are increases after the year 1800. Some are evening out between 1800 – 1840 others are increasing from 1820. From all of the infliction forms sum of the word jump the infliction with the verb shows an increase and one high stability state between 1954 and 1981.

c) There are a few interactions of the different tenses and part-of-speech in the graph such as with jump_NOUN and jumping_VERB in around the year 1917. And for tense and part-of- speech for jump_VERB and jump_NOUN in around the year 1959 and 1987.

d) Through the years the word jump has been used in different tenses and usage (in description, as a verb or as a noun).

Search for Parts of Speech (not a specific word)

No Specific Word - Parts-Of-Speech - Google Ngram Viewer

a) Code: *_ADJ

b) The graph shows how the phrase has occurred over a corpus English of books. It shows a decrease from 1800 to 2000. It shows one small peak in 1952. The adjective part-of-speech was shown a lot between the years 1800 – 1804.

c) If you change the smoothness of the term on the graph from the corpus English the graph is still showing a decrease.

d) This graph is analyzing the corpus English so the term adjective could be used differently in different corpuses. For example from the corpus Russian, from the year 1925-2000 the behaviour of the graph seems to suddenly decrease for the usage of an adjective in speech.