Instructor: Itamar Caspi
Teaching assistant: Ariel Karlinsky
🗓️ Second semester, 2022-2023 ⏰ 16:30 - 19:15 🏨 Social Sciences 2202 ✍️ github.com/ml4econ 👪 Moodle Discussion Forum
This course encompasses a wide range of topics including data science, machine learning, and econometrics. Its primary objective is to acquaint students with the fundamental concepts of machine learning that can potentially enhance empirical economics. The course introduces popular supervised and unsupervised machine learning methods, while emphasizing the challenges and opportunities of incorporating these methods in empirical economics. Additionally, it highlights the relevance of machine learning to policy analysis and causal inference. The course utilizes real-world applications, empirical articles, and hands-on assignments to illustrate various topics.
During the course, attendees will gain proficiency in the following:
-
Implementing data science best-practices in the context of empirical research within economics.
-
Acquiring comprehensive and pragmatic knowledge of the obstacles and prospects encountered in applied empirical work utilizing high-dimensional data.
-
Assimilating techniques and insights from the field of machine learning into applied empirical research conducted in economics.
It is anticipated that participants in the course will:
-
Possess their own computers with R, RStudio (Posit), Git, and GitHub Desktop installed.
-
Register for a GitHub and Kaggle account, which are available at no cost.
The following schedule is subject to change based on class interests and time constraints. Our pace will be guided by class discussions, and we kindly request that you check this page regularly for updates.
Week | Topic |
---|---|
1 | Course Overview |
2 | Basic ML Concepts |
3 | Reproducibility & ML Workflow |
4 | Regression and Regularization |
5 | Classification |
6 | Trees and Forests |
7 | Causal inference |
8 | High-Dimensional Counfounding Adjustment |
9 | High-Dimensional Heterogeneous Treatment Effects |
10 | Text Analysis |
11 | Large Language Models |
-
Regression and Regularization (HTML) (PDF)
5.1 Prepare browser data
5.2 Ridge and lasso simulation
5.3 Ridge, lasso, PCR and PLS: A Tidymodels Workflow -
Classification (HTML) (PDF)
6.1Classification: A Tidymodels workflow
TBA
A. Kaggle competition (HTML)
(PDF)
B.Replication Project Example (HTML)
Can be found here.
-
Itamar Caspi is Head of the Monetary Analysis Unit in the Research Department of the Bank of Israel, and an adjunct lecturer at the Hebrew University, having started off in 2010 as an Economist at the Ministry of Finance. In 2012 he moved to the Bank of Israel, and was promoted in 2018 to Senior Economist and elected to represent the Bank as a Research Fellow for three months at the Bank for International Settlements in Switzerland. He holds a BA in Economics and Business Administration from Ben-Gurion University, MA in Economics from the joint research program at Hebrew University and Tel-Aviv University, and a PhD in economics from Bar-Ilan University.
-
Ariel Karlinsky is an Economist and Statistician, a PHD Student in Economics at the Hebrew University.He mostly does empirical work, my primary research interests are Labor Economics, Data Science, Economics of Conflict, Health Economics, Economics of Israel and Economic History. Ariel is the creator and maintainer of the World Mortality Dataset, an open dataset that tracks all-cause-mortality data from around the world. I’m a member of the World Health Organization Technical Advisory Group on COVID-19 Mortality Assessment. member of The Israeli Economic History Association and Midaat, an NGO dedicated to promoting public health via evidence based medicine and practices.
This work is licensed under a Creative Commons Attribution 4.0 International License.