Yi-Ting Chen, "A Distributional Synthetic Control Method for Policy Evaluation," Journal of Applied Econometrics, Vol. 35, No. 5, 2020, pp. 505-525. The zip file "ytc-files.zip" contains the datasets used in the empirical applications and the R codes generating the simulation and empirical results. Since the file contains both binary and ASCII files, Unix/Linux users should be careful. The files are organized into the following folders: [Data] [Codes]/[Empirical] /[Simulation] The files in each folder are described as follows. ======================================= Part I. Data for empirical applications ======================================= The folder [Data] includes the following five files: 1. county data.csv (the county-level restaurant workers' earnings and employments) 2. US_state_comparison_1996.csv (the observed state features for Cases 1 & 2 in Section 5 of the paper) 3. US_state_comparison_1998.csv (the observed state features, Case 3 in Section 5 of the paper) 4. US_state_comparison_2004.csv (the observed state features, Case 4 in Section 5 of the paper) 5. StateData.zip (a zip file that includes a R code "State report.R" and original data for generating US_state_comparison_1996.csv and the other two related csv files.) Data sources: ------------- 1. "county data.csv" is a subset data generated from the Stata data file "QCEWindustry_minwage_all.dta," which contained in the replication data "minwage_dubeRESTATv2.zip" of the following reference: Arindrajit Dube; T. William Lester; Michael Reich, 2011, "Replication data for: Minimum Wage Effects across State Borders: Estimates Using Contiguous Counties", https://doi.org/10.7910/DVN/L4DUZ7, Harvard Dataverse, V2 2. "US_state_comparison_1996.csv" and the other two related csv files are generated from different public data sources. Please see the Word file "annual US data resource.docx" contained in "StateData.zip" for these data sources. Data format of county data.csv: ------------------------------- Sampling period: 1990:Q1-2006:Q2 Variables (columns): 1. time 2. statename (state name) 3. countyreal (county FIPS code) 4. countyname (county name) 5. countypop2000 (county population, 2000) 6. cntypopdens (county density, 2000) 7. cntyarea (county land area) 8. federalmin (federal minimum wage) 9. stminwage (state minimum wage) 10. minwage (combination of Federal and State minimum wage) 11. empTOT (total employment) 12. emp_rest_both (restaurant employment, NAICS 7221, NAICS7222) 13. AWW_rest_both (restaurant average weekly wage, NAICS 7221, NAICS7222) 14. emp722 (food services and drinking places employment) 15. AWW722 (food services and drinking places average weekly wage) 16. empRETAIL (retail employment) 17. AWWRETAIL (retail average weekly wage) 18. empMFG (manufacturing employment) 19. AWWMFG (manufacturing average weekly wage) Data format of US_state_comparison_1996.csv (and the other two csv files): -------------------------------------------------------------------------- Variables (rows) for each state: 1. Resident population 2. White alone, percentage 3. 65 years old and over, percentage 4. Land area 5. Population per square mile 6. Person below poverty level, percentage 7. Personal income per capita 8. Real median household income 9. Annual nonfarm 10. New private housing units authorized by buliding permits (Please see Section 5.1 of the paper for details.) Note: Data for California's tobacco control program (CTCP) ---------------------------------------------------------- The supplementary appendix of this paper performs a case study on CTCP considered by the following reference: Abadie, A., Diamond, A., & Hainmueller, J. (2010). Synthetic control methods for comparative case studies: Estimating the effect of California's Tobacco Control Program, Journal of the American Statistical Association, 105, 493-505. This part needs the data "synth_smoking.csv" which is the csv version of the file "synth_smoking.dta" downloaded from http://fmwww.bc.edu/repec/bocode/s/synth_smoking.dta =========================================== Part II. R Codes for Empirical applications =========================================== The folder [Codes]/[Empirical] includes the following four files: 1.LoadData_and_DefineEvents.R (Figures A.8-A.11, Table A.3 and Section 2.2.3 of the Appendix) 2.Distributional_Synthetic_Control_Analysis.R (This part implements the four cae studies. Generate the potential control units, the synthetic controls and the placebo synthetic controls for the earning effects (wage effects) and the employment effects.) 3.Present_Empirical_Results.R (Figures 2-7, Figures A.12-A.13, Tables A.4-A.5) 4.Empirical_CTCP.R (Figures A.14-A.17) Note: Files #1-#3 are for the case studies presented in the paper. File #4 is for the case study presented in the supplementary appendix. ================================ Part III. R Codes for Simulation ================================ The folder [Codes]/[Simulation] includes the following two files: 1.Simulation.R 2.Present_Simulation_Results (Figure 1, Figure A.1-A.7, Tables A.1-A.2)