University of Warsaw - Central Authentication System
Strona główna

(in Polish) Computational Social Science: An Introduction to Data Science

General data

Course ID: 2500-PL-PS-SP15-13
Erasmus code / ISCED: 14.4 Kod klasyfikacyjny przedmiotu składa się z trzech do pięciu cyfr, przy czym trzy pierwsze oznaczają klasyfikację dziedziny wg. Listy kodów dziedzin obowiązującej w programie Socrates/Erasmus, czwarta (dotąd na ogół 0) – ewentualne uszczegółowienie informacji o dyscyplinie, piąta – stopień zaawansowania przedmiotu ustalony na podstawie roku studiów, dla którego przedmiot jest przeznaczony. / (0313) Psychology The ISCED (International Standard Classification of Education) code has been designed by UNESCO.
Course title: (unknown)
Name in Polish: Computational Social Science: An Introduction to Data Science
Organizational unit: Faculty of Psychology
Course groups:
ECTS credit allocation (and other scores): 4.00 Basic information on ECTS credits allocation principles:
  • the annual hourly workload of the student’s work required to achieve the expected learning outcomes for a given stage is 1500-1800h, corresponding to 60 ECTS;
  • the student’s weekly hourly workload is 45 h;
  • 1 ECTS point corresponds to 25-30 hours of student work needed to achieve the assumed learning outcomes;
  • weekly student workload necessary to achieve the assumed learning outcomes allows to obtain 1.5 ECTS;
  • work required to pass the course, which has been assigned 3 ECTS, constitutes 10% of the semester student load.
Language: English
Short description:

This course is an introduction to data science for social scientists. During the course students will learn, using practical examples, how new computational methods may be applied to social psychology and social sciences in general and how they can be used to study phenomena that are hard to track with traditional methods. After the course a student should know what the available tools are, how they work, and how they might be applied to answer questions social scientists may ask. The course will cover basic concepts of computational social science such as how to use external data sources (primarily web-based), most important web data formats, popular computational tools and environments, working with APIs, webscraping, and Natural Language Processing (NLP). Each topic will be illustrated with real-life examples, and students will have the possibility to not only learn basic concepts and see real-world applications but also apply the methods in practice working on very simple examples.

Learning outcomes:

By the end of the semester students should be able to:

 understand basic concepts of computational social science.

 communicate with data scientists / computer programmers etc. (using adequate vocabulary).

 understand advantages, challenges, and limitations of computational methods in social sciences.

 formulate research questions that can be addressed with computational methods and/or data extracted from existing web-based data sources.

 plan research using computational methods (especially webscraping, web API data extraction, and natural language processing)

 use materials from the course to scrap a website, work with a simple API, and perform basic Natural Language Processing.

Classes in period "Winter semester 2023/24" (past)

Time span: 2023-10-01 - 2024-01-28
Selected timetable range:
Navigate to timetable
Type of class:
Classes, 30 hours more information
Coordinators: (unknown)
Group instructors: Mikołaj Biesaga
Students list: (inaccessible to you)
Examination: Course - Grading
Classes - Grading
Full description:

The development of Internet and social media, opens the whole new world of possibilities for social scientists to track human behavior. The questions that were hard to handle using traditional methods of data collection now can be addressed. Furthermore, the new possibilities allow for the formulation of new questions and tracking phenomena, which were impossible to follow before. However, the new sources of information require from social scientists to work on the verge of social science and computer science. This new area is usually called computational social science. Therefore, social scientists need to learn what type of data is available out there and how to collect it. It does not necessarily mean that they need to learn computer science because they might cooperate with computer scientists, but at least they need to understand the basic concepts to be able to plan adequate research.

This course is an introduction to data science for social scientists, therefore, it will introduce basic concepts only. It will not cover advanced methods, techniques, and theories. During the course following topics will be introduced: available data sources, data formats, popular computational tools, and environments, working with APIs, webscraping, and Natural Language Processing (NLP). However, the focus will be not on the technical aspect but on the possible applications for social scientists. Each topic will be illustrated with real-life examples, and students will have the possibility to not only learn basic concepts but also apply the methods in practice working on very simple examples.

The hands-on workshops will be based on prepared scripts that will require only simple configuration from students, however, some notion of Python and R programming languages is required (for example, finished course on Programming in Python for Social Scientists and Introduction to Statistics in R).

At the end of the course students will be able to understand basic concepts of computational social science, communicate with data scientists / computer scientists using adequate vocabulary, and foremost formulate research questions that can be addressed with computational methods and/or data extracted from existing web-based data sources.

Bibliography:

Bibliography:

Main

 Vallacher, R. R., Read, S. J., & Nowak, A. (Eds.). (2017). Computational social psychology. Routledge.

 Kosinski, M., Stillwell, D., & Graepel, T. (2013). Private traits and attributes are predictable from digital records of human behavior. Proceedings of the National Academy of Sciences of the United States of America, 110(15), 5802–5805.

Supplementary

 Guttag, J.V. (2021). Introduction to Computation and Programming Using Python. The MIT Press

 Wickham, H., & Grolemund, G. (2016). R for data science: import, tidy, transform, visualize, and model data. O'Reilly Media, Inc.

 Wickham, H. (2014). Advanced r. Chapman and Hall/CRC.

 Silge, J., & Robinson, D. (2017). Text mining with R: A tidy approach. O'Reilly Media, Inc.

 Kosinski, M., Wang, Y., Lakkaraju, H., & Leskovec, J. (2016). Mining big data to extract patterns and predict real-life outcomes. Psychological methods, 21(4), 493--506.

Additional online resources and tools will be made available when appropriate during the course

Classes in period "Winter semester 2024/25" (future)

Time span: 2024-10-01 - 2025-01-26
Selected timetable range:
Navigate to timetable
Type of class:
Classes, 30 hours more information
Coordinators: (unknown)
Group instructors: Mikołaj Biesaga
Students list: (inaccessible to you)
Examination: Course - Grading
Classes - Grading
Full description:

The development of Internet and social media, opens the whole new world of possibilities for social scientists to track human behavior. The questions that were hard to handle using traditional methods of data collection now can be addressed. Furthermore, the new possibilities allow for the formulation of new questions and tracking phenomena, which were impossible to follow before. However, the new sources of information require from social scientists to work on the verge of social science and computer science. This new area is usually called computational social science. Therefore, social scientists need to learn what type of data is available out there and how to collect it. It does not necessarily mean that they need to learn computer science because they might cooperate with computer scientists, but at least they need to understand the basic concepts to be able to plan adequate research.

This course is an introduction to data science for social scientists, therefore, it will introduce basic concepts only. It will not cover advanced methods, techniques, and theories. During the course following topics will be introduced: available data sources, data formats, popular computational tools, and environments, working with APIs, webscraping, and Natural Language Processing (NLP). However, the focus will be not on the technical aspect but on the possible applications for social scientists. Each topic will be illustrated with real-life examples, and students will have the possibility to not only learn basic concepts but also apply the methods in practice working on very simple examples.

The hands-on workshops will be based on prepared scripts that will require only simple configuration from students, however, some notion of Python and R programming languages is required (for example, finished course on Programming in Python for Social Scientists and Introduction to Statistics in R).

At the end of the course students will be able to understand basic concepts of computational social science, communicate with data scientists / computer scientists using adequate vocabulary, and foremost formulate research questions that can be addressed with computational methods and/or data extracted from existing web-based data sources.

Bibliography:

Bibliography:

Main

 Vallacher, R. R., Read, S. J., & Nowak, A. (Eds.). (2017). Computational social psychology. Routledge.

 Kosinski, M., Stillwell, D., & Graepel, T. (2013). Private traits and attributes are predictable from digital records of human behavior. Proceedings of the National Academy of Sciences of the United States of America, 110(15), 5802–5805.

Supplementary

 Guttag, J.V. (2021). Introduction to Computation and Programming Using Python. The MIT Press

 Wickham, H., & Grolemund, G. (2016). R for data science: import, tidy, transform, visualize, and model data. O'Reilly Media, Inc.

 Wickham, H. (2014). Advanced r. Chapman and Hall/CRC.

 Silge, J., & Robinson, D. (2017). Text mining with R: A tidy approach. O'Reilly Media, Inc.

 Kosinski, M., Wang, Y., Lakkaraju, H., & Leskovec, J. (2016). Mining big data to extract patterns and predict real-life outcomes. Psychological methods, 21(4), 493--506.

Additional online resources and tools will be made available when appropriate during the course

Course descriptions are protected by copyright.
Copyright by University of Warsaw.
Krakowskie Przedmieście 26/28
00-927 Warszawa
tel: +48 22 55 20 000 https://uw.edu.pl/
contact accessibility statement USOSweb 7.0.3.0 (2024-03-22)