Universität Wien
Warning! The directory is not yet complete and will be amended until the beginning of the term.

200144 SE Seminar in Applied Psychology: Mind and Brain (2024W)

Introduction to Scientific Computing in Social Sciences

4.00 ECTS (2.00 SWS), SPL 20 - Psychologie
Continuous assessment of course work

Anwendungsseminare können nur fürs Pflichtmodul B verwendet werden! Eine Verwendung fürs Modul A4 Freie Fächer ist nicht möglich.

Registration/Deregistration

Note: The time of your registration within the registration period has no effect on the allocation of places (no first come, first served).

Details

max. 20 participants
Language: English

Lecturers

Classes (iCal) - next class is marked with N

  • Tuesday 08.10. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 15.10. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 22.10. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 29.10. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 05.11. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 12.11. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 19.11. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 26.11. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 03.12. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 10.12. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 17.12. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 07.01. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 14.01. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607
  • Tuesday 28.01. 15:00 - 16:30 PCR Computerhörsaal Psychologie, NIG 6.Stock A0607

Information

Aims, contents and method of the course

In this course students will learn how to use natural language processing tools to automatically analyze the content of large numbers of texts. After this course students will be able to perform sentiment analyses of modern textual sources from the internet but also of historical sources such as fiction work (movies and books) from the past. Students will be able to quantify the temporal change of values and preferences expressed in these textual sources and its relationship to historical events and socio-economic dynamics. BASIC KNOWLEDGE OF PYTHON AND R are highly recommended.

Contents

Week 1 - Introduction to digital humanities. Brief introduction to computational social sciences; description of the general set of tools and datasets.

Week 2 - Bag of words approach. Refresher of Python syntax and objects; How to create a bag of words using dictionary tools like WordNet.

Week 3 - Preprocessing in NLP and Frequency analysis. Why preprocessing, general steps; How to extract the most frequent words after preprocessing; How to extract the most frequent words in a text;

Week 4 - Introduction to datasets with Pandas. Iterate over cells and texts; Generate clouds of words using WordCloud and average frequency plots.

Week 5 - Sentiment analysis. Overview of techniques used to implement sentiment analysis; Implementation examples.

Week 6 - Introduction to Web scrapping; How to read and extract information from a website; Using BeautifulSoup to read HTML and XML files.

Week 7 - Iteratively collect information from a website using loops; Implementation of the sentiment analysis pipeline to online information.

Week 8 - Webscrapping using API. What are APIs. How to use APIs to extract information from the internet (e.g. IMDB, Twitter); Implementation of the sentiment analysis pipeline to API information.

Week 9 - Culture as fossilized psychology. The challenge of analyzing historical texts. How to use psychometric tools to build valid bags-of-words; plot historical sentiment time series using Seaborn.

Week 10 - Handling Socioeconomical data. Surveying the major datasets for contemporary and historical socioeconomic estimates; Understanding the meaning and differences between socioeconomic variables and how they are affected by historical events; Extracting socio-economic data from datasets and adding them to a dataset of textual word frequencies.

Week 11 - Testing hypotheses on the relationship between historic events, socioeconomics, and word-frequencies. Quasi-experimental methods, time series trends, linear mixed models; Linear mixed model trend comparison, model selection with stepAIC; Implementation with R.

Week 12 - Project presentations and Q&A

Assessment and permitted materials

Class homework assignments, including 3 small assignments (3 x 20%) and a final project (40%) (Individual)

Minimum requirements and assessment criteria

Students must at least demonstrate conceptual knowledge and understanding of the tools and processes used in computational social sciences. They must be able to conceptualize a project using these tools and present this conceptualization in an oral presentation. Grades improve if students are also able to implement these projects using provided (or custom) scripts in Python and R programming languages, or other computational social sciences tools.

Examination topics

Learning Objectives:
1. Automatically scrape large numbers of texts from the internet
2. Use natural language processing tools to perform automatic text analysis, including sentiment analysis.
3. Quantify the change of values and preferences across time, its relationship with historical events and with socio-economic conditions using basic econometrics analysis tools.

Learning activities
1. Lectures: students engage in discussions about the readings and the theory provided by the professors.
2. Practical work: students will aim to design experiments individually (this will also include training students in R and Python).
3. Student Activities (Individual/group): students develop their final projects and work on their assignment.

Reading list

Harvard python course:
- https://www.youtube.com/watch?v=nLRL_NcnK-4&t=7492s

Computational social science courses:
- https://ayoubbagheri.nl/applied_tm/
- https://github.com/JanaLasser/SICSS-aachen-graz
- http://digitalmedia.andreasjungherr.de/docs/intro.html
- Engel, U., Quan-Haase, A., Liu, S., & Lyberg, L. (Eds.). (2021). Handbook of Computational Social Science, Volume 2: Data Science, Statistical Modelling, and Machine Learning Methods (1st ed.). Routledge. https://doi.org/10.4324/9781003025245

Historical Psychology:
- Martins & Baumard (2022). How to develop reliable instruments to measure the cultural evolution of preferences and feelings in history? Frontiers in Psychology (13) https://osf.io/acukm/


Association in the course directory

Last modified: Mo 09.09.2024 11:46