Linguistics 460: Textual Data Analysis with R
UNC-Chapel Hill Linguistics
Fall 2024
Elliott Moreton

2024.05.03.F, 4-7 p.m.

FINAL PROJECT PRESENTATIONS

2024.04.29.M

Topics: MIDTERM 2

2024.04.24.W

Topics: Midterm 2 review

Class:

Assignments:

2024.04.22.M

Topics: Project clinic in class.

Class:

Assignments:

2024.04.17.W

Topics: LASSO regression for document classification.

Class:

Assignments:

2024.04.15.M

Topics: Evaluating classifier performance. Changing parameters to optimize performance

Class:

Assignments:

2024.04.10.W

Topics: Naive Bayes classifiers for document classification.

Class:

Assignments:

2024.04.08.M

Topics: Document classification using supervised learning. Document-term matrices.

Class:

Assignments:

2024.04.03.W

Topics: Sentiment analysis.

Class:

Assignments:

2024.04.01.M

Topics: Sentiment analysis.

Class:

Assignments:

2024.03.27.W

Topics: "Tidying" text.

Class:

Assignments:

2024.03.25.M

Topics: "Tidying" text

Class:

Assignments:

2024.03.20.W

Topics: Regular expressions.

Class:

Assignments:

2024.03.18.M

Topics: Final projects. Regular expressions.

Class:

Assignment:

2024.03.06.W

MIDTERM 1

Assignment for 3/18 (M): Read Chapters 1--3 of Handling and Processing Strings in R, by Gaston Sanchez.

2024.03.04.M

Topics: Midterm review

Class:

2024.02.28.W

Topics: Linear regression

Class:

Announcenment: Midterm on Wednesday, March 6. A midterm syllabus will be unveiled on Canvas at the end of class.

2024.02.26.M

Topics: Tests of association between categorical variables

Class:

Assignment:

Announcement: Office hours on 2/27 Tues. will be later than usual (4:15--5:15 instead of 2-3) and by Zoom (link is on syllabus).

2024.02.21.W

Topics: Testing hypotheses about a population mean.

Class:

Assignment for 2/26 M:

2024.02.19.M

Topics: Null-hypothesis significance testing

Class:

Assignment:

2024.02.14.W

Topics: Confidence intervals

Class:

Assignment:

2024.02.07.W

Topics: Sampling theory.

Class:

Assignment for 2/14 W:

Note: If the illustrations are not showing up for you in the Web-based version of Navarro 2017, a pdf can be found here.

2024.02.05.M

Topics: Frequentist vs. Bayesian statistics. Probability distributions. Samples.

Class:

Assignment: Read Navarro 2017, Ch. 10 through the end of 10.4. There is no associated quiz.

Note: If the illustrations are not showing up for you in the Web-based version of Navarro 2017, a pdf can be found here.

2024.01.31.W

Topics: Descriptive statistics: Variability

Class:

Assignment for 2/5 M:

2024.01.29.M

Topics: Descriptive statistics

Class:

Assignment: Reading, from Navarro 2017: 5.1, 5.2, 5.4, 5.5, 5.7--5.10. Also: Quiz 2 (on Canvas).

2024.01.24.W

Topics: Data frames and basic visualization.

Class:

Assignment: Lab 1

2024.01.22.M

Topics: Data frames. Start visualization.

Class:

Announcement: Office hours are T 2-3 and F 2:30-3:30

2024.01.17.W

Topics: R data types

Class:

Assignment for 1/22 M:

2024.01.10.W

Topics: Course organization. R and RStudio.

Class:

Assignment for 1/17 W: