This is part of an extended look at the NYC School Arts Survey:
Part One: Arts Education Liaisons
Part Two: Arts Education Supervisors
The survey had several questions regarding which resources schools are devoting to arts education, whether the administrator thought they were sufficient and which non-Department of Education sources of support were available to them.
Schools were asked how many rooms they have dedicated to arts education. Rooms were divided into two categories: “rooms designed and used solely for the arts” and “multi-purpose or general education classroooms used for arts education”. Five arts disciplines were considered: dance, music, theater, visual arts, and media arts (a category which includes film and photography). Note that the term media arts refers to film programs.
Number of rooms is certainly not a direct metric for a school’s commitment to the arts. I hypothesize that it may prove useful for assessing the resources that are made available for arts education more broadly. Of course, there may be confounding factors, such as school size and borough.
All disciplines show the same trend; many schools have zero to two rooms for a given discipline and a few schools have more than twenty rooms. Having twenty-some rooms might be bordering on unrealistic, but the above plots depict both categories of rooms, those designed solely for the arts and multi-purpose. Will we get more reasonable numbers by examining the room counts separately?
Looking at the number of rooms designed solely for the arts, the maximum number of rooms is a more reasonable thirteen. For a school with several hundred students and a dedicated arts program I can picture that. If we look at multi-purpose rooms, we see that this class contains the bulk of the rooms. Not many schools have dozens of multi-purpose rooms for arts disciplines, but again, in the context of a large school, it seems like a possible number for some schools to have.
I would be curious to discover the overlap for multi-purpose rooms among arts disciplines. In other words, is a school with twenty multi-purpose rooms reporting some of those rooms as in use for multiple arts discipline? I do not see any way to completely correct for that possibility, but it would be interesting to control for school size to try to get to the bottom of it. If this does not work, then we could consider only the number of rooms designed solely for art, but this could penalize small schools unduly.
##
## Call:
## lm(formula = perc_34_all_2018_ela ~ ., data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -38.706 -15.286 -3.482 13.757 57.410
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 37.172301 1.540811 24.125 < 2e-16 ***
## rm_dance 0.198147 0.203517 0.974 0.331
## rm_music -0.019724 0.170177 -0.116 0.908
## rm_thtr -0.046773 0.195201 -0.240 0.811
## rm_visart 0.186354 0.137639 1.354 0.176
## rm_media -0.141977 0.162441 -0.874 0.382
## total_enrollment_2017 0.014001 0.002242 6.245 7.33e-10 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 19.62 on 706 degrees of freedom
## (552 observations deleted due to missingness)
## Multiple R-squared: 0.06461, Adjusted R-squared: 0.05666
## F-statistic: 8.127 on 6 and 706 DF, p-value: 1.642e-08
##
## Call:
## lm(formula = perc_34_all_2018_math ~ ., data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -44.704 -17.891 -4.972 15.429 60.377
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 30.379442 1.751426 17.346 < 2e-16 ***
## rm_dance 0.187916 0.231335 0.812 0.4169
## rm_music 0.142808 0.193439 0.738 0.4606
## rm_thtr 0.048594 0.221883 0.219 0.8267
## rm_visart 0.258630 0.156453 1.653 0.0988 .
## rm_media -0.206618 0.184645 -1.119 0.2635
## total_enrollment_2017 0.018216 0.002549 7.147 2.21e-12 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 22.3 on 706 degrees of freedom
## (552 observations deleted due to missingness)
## Multiple R-squared: 0.09644, Adjusted R-squared: 0.08876
## F-statistic: 12.56 on 6 and 706 DF, p-value: 1.747e-13
Conditioning on the effect of school size, through total enrollment, there is no statistically significant effect for the total number of rooms for arts education on student academic performance.
##
## Call:
## lm(formula = perc_34_all_2018_ela ~ ., data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -39.233 -15.704 -3.011 13.495 54.952
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 36.98124 1.56537 23.625 < 2e-16 ***
## Q8_R1_C1 -1.92913 1.36332 -1.415 0.157452
## Q8_R2_C1 3.30383 0.91647 3.605 0.000332 ***
## Q8_R3_C1 0.92597 1.04167 0.889 0.374312
## Q8_R4_C1 0.16888 1.18807 0.142 0.886998
## Q8_R5_C1 -1.84882 1.00594 -1.838 0.066449 .
## total_enrollment_2017 0.01312 0.00221 5.938 4.3e-09 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 19.51 on 797 degrees of freedom
## (461 observations deleted due to missingness)
## Multiple R-squared: 0.08419, Adjusted R-squared: 0.0773
## F-statistic: 12.21 on 6 and 797 DF, p-value: 3.603e-13
##
## Call:
## lm(formula = perc_34_all_2018_math ~ ., data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -39.170 -17.415 -4.067 14.767 58.385
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 31.887968 1.772957 17.986 < 2e-16 ***
## Q8_R1_C1 -3.390478 1.544105 -2.196 0.0284 *
## Q8_R2_C1 2.964480 1.037997 2.856 0.0044 **
## Q8_R3_C1 0.795531 1.179807 0.674 0.5003
## Q8_R4_C1 -0.890543 1.345613 -0.662 0.5083
## Q8_R5_C1 -2.400571 1.139337 -2.107 0.0354 *
## total_enrollment_2017 0.019294 0.002503 7.707 3.82e-14 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 22.1 on 797 degrees of freedom
## (461 observations deleted due to missingness)
## Multiple R-squared: 0.104, Adjusted R-squared: 0.09722
## F-statistic: 15.41 on 6 and 797 DF, p-value: < 2.2e-16
Controlling for school size, through total student enrollment, we can see a statistically significant (p-value less than 0.001) positive coefficient for the effect of additional rooms designed and used solely for music, on ELA state test scores. If we consider math scores, rooms solely dedicated to music still have a positive effect, but at a 0.0044 p-value. No other arts disciplines have a statistically significant effect on ELA scores, but dance and media arts rooms have statistically significant (p-value less than 0.05) negative effects on math scores.
When we consider multi-purpose rooms, no arts discipline has a statistically significant effect on academic performance.
I am curious about the reason behind why music rooms appear to be associated with higher test scores, and why dance and media arts are associated with lower scores. To investigate we can try to control for different features and divide the performance metric by grade.
## [1] "perc_34_5_2018_ela"
##
## Call:
## lm(formula = f.ela, data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -37.072 -16.524 -3.894 15.154 65.915
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 30.393260 2.172697 13.989 < 2e-16 ***
## total_enrollment_2017 0.010224 0.002857 3.579 0.000375 ***
## Q8_R1_C1 -1.070094 1.677094 -0.638 0.523694
## Q8_R2_C1 3.047105 1.061100 2.872 0.004239 **
## Q8_R3_C1 -0.076777 1.189206 -0.065 0.948546
## Q8_R4_C1 -0.120480 1.501049 -0.080 0.936057
## Q8_R5_C1 -0.440556 1.291528 -0.341 0.733148
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 20.13 on 556 degrees of freedom
## Multiple R-squared: 0.04519, Adjusted R-squared: 0.03489
## F-statistic: 4.386 on 6 and 556 DF, p-value: 0.000242
##
## [1] "perc_34_6_2018_ela"
##
## Call:
## lm(formula = f.ela, data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -41.030 -16.090 -3.834 12.920 55.424
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 32.107149 2.400189 13.377 < 2e-16 ***
## total_enrollment_2017 0.019201 0.003511 5.469 9.17e-08 ***
## Q8_R1_C1 -0.386109 2.053572 -0.188 0.85098
## Q8_R2_C1 2.890346 1.386475 2.085 0.03789 *
## Q8_R3_C1 3.378799 2.188257 1.544 0.12357
## Q8_R4_C1 1.694171 1.881632 0.900 0.36860
## Q8_R5_C1 -4.765419 1.542141 -3.090 0.00218 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 20.26 on 319 degrees of freedom
## Multiple R-squared: 0.1908, Adjusted R-squared: 0.1756
## F-statistic: 12.53 on 6 and 319 DF, p-value: 1.08e-12
## [1] "perc_34_5_2018_math"
##
## Call:
## lm(formula = f.math, data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -42.241 -19.010 -3.535 16.810 65.205
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 30.370138 2.450852 12.392 < 2e-16 ***
## total_enrollment_2017 0.015948 0.003225 4.944 1.01e-06 ***
## Q8_R1_C1 -1.021941 1.892332 -0.540 0.589384
## Q8_R2_C1 3.959590 1.195636 3.312 0.000988 ***
## Q8_R3_C1 -0.436070 1.340461 -0.325 0.745066
## Q8_R4_C1 -1.058457 1.691447 -0.626 0.531723
## Q8_R5_C1 -0.904146 1.456523 -0.621 0.535015
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 22.68 on 555 degrees of freedom
## Multiple R-squared: 0.07056, Adjusted R-squared: 0.06051
## F-statistic: 7.023 on 6 and 555 DF, p-value: 3.232e-07
##
## [1] "perc_34_6_2018_math"
##
## Call:
## lm(formula = f.math, data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -48.190 -16.714 -4.699 10.401 67.053
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 18.231937 2.592891 7.032 1.25e-11 ***
## total_enrollment_2017 0.021830 0.003808 5.733 2.29e-08 ***
## Q8_R1_C1 -0.022742 2.227755 -0.010 0.99186
## Q8_R2_C1 3.955813 1.502217 2.633 0.00887 **
## Q8_R3_C1 2.114419 2.368073 0.893 0.37259
## Q8_R4_C1 2.738886 2.038702 1.343 0.18008
## Q8_R5_C1 -4.635148 1.673390 -2.770 0.00594 **
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 21.97 on 319 degrees of freedom
## Multiple R-squared: 0.218, Adjusted R-squared: 0.2033
## F-statistic: 14.82 on 6 and 319 DF, p-value: 5.957e-15
If we break down academic performance by grade, then the results are not as straigtforward. For sixth-, seventh- and eighth-graders’ ELA performance, media arts rooms have a more statistically significant negative coefficient than the positive coefficient for music rooms. All grades show a positive coefficient for music rooms, with a p-value no more than 0.01. There is a large jump in media arts between fifth- and sixth-grades, which is the demarcation between elementary and middle schools. A jump of this nature could have an underlying reason, beyond increased susceptibility to media arts programs at the expense of academic performance beginning in middle school. Perhaps not many elementary schools have media arts programs at all, or they are more commonly found in schools that are otherwise performing at an atypical level (higher or lower). Both of these can be checked.
Math scores show a similar trend to ELA scores, with most grades showing a statistically significant positive coefficient for music rooms. Among middle schools, media arts rooms have a statistically significant negative coefficient. The key difference with ELA scores is that there is not a statistically significant positive coefficient for music rooms on eighth-grade math scores.
I do not see a significant difference in the distribution of art rooms between elementary and middle schools. Music and visual arts rooms are the most common. Let’s look at the performance of schools with media arts programs.
We can match schools on having media arts rooms while controlling for number of students, number of students in each grade, and perhaps other demographic and academic features. Then we can examine the effect of media arts rooms on academic performance.
##
## Call:
## matchit(formula = f, data = temp, method = "nearest", distance = "logit")
##
## Summary of balance for all data:
## Means Treated Means Control SD Control Mean Diff
## distance 0.4353 0.4066 0.0652 0.0287
## perc_34_all_2018_ela 46.6335 46.3189 20.4054 0.3147
## total_enrollment_2017 655.2789 582.9615 325.9929 72.3174
## grd_1_2017 61.4113 66.4686 56.7962 -5.0573
## grd_2_2017 62.2197 67.0081 57.3009 -4.7884
## grd_3_2017 64.3408 69.3570 60.2137 -5.0162
## grd_4_2017 64.3634 68.6998 62.2163 -4.3364
## grd_5_2017 64.7155 68.8499 63.3577 -4.1344
## grd_6_2017 76.4901 42.7688 82.3900 33.7214
## grd_7_2017 77.1099 42.4604 83.9690 34.6494
## grd_8_2017 77.1690 42.1339 84.4076 35.0351
## eQQ Med eQQ Mean eQQ Max
## distance 0.0101 0.0291 0.192
## perc_34_all_2018_ela 1.1000 1.3214 11.100
## total_enrollment_2017 59.0000 74.1915 238.000
## grd_1_2017 4.0000 5.0338 53.000
## grd_2_2017 3.0000 4.8113 48.000
## grd_3_2017 3.0000 4.9746 49.000
## grd_4_2017 3.0000 5.2761 40.000
## grd_5_2017 5.0000 5.0423 42.000
## grd_6_2017 0.0000 34.1662 253.000
## grd_7_2017 0.0000 35.1944 281.000
## grd_8_2017 0.0000 35.6197 254.000
##
##
## Summary of balance for matched data:
## Means Treated Means Control SD Control Mean Diff
## distance 0.4353 0.4178 0.0721 0.0175
## perc_34_all_2018_ela 46.6335 45.0045 19.9245 1.6290
## total_enrollment_2017 655.2789 616.0394 337.3379 39.2394
## grd_1_2017 61.4113 65.1183 58.3054 -3.7070
## grd_2_2017 62.2197 65.9211 59.5015 -3.7014
## grd_3_2017 64.3408 67.9239 62.3438 -3.5831
## grd_4_2017 64.3634 68.7352 64.3851 -4.3718
## grd_5_2017 64.7155 69.4873 65.0190 -4.7718
## grd_6_2017 76.4901 53.9972 92.9389 22.4930
## grd_7_2017 77.1099 54.0197 95.1002 23.0901
## grd_8_2017 77.1690 53.8000 95.7602 23.3690
## eQQ Med eQQ Mean eQQ Max
## distance 0.0008 0.0177 0.1721
## perc_34_all_2018_ela 1.5000 2.1676 11.1000
## total_enrollment_2017 31.0000 41.0648 171.0000
## grd_1_2017 3.0000 3.8085 53.0000
## grd_2_2017 2.0000 3.9324 48.0000
## grd_3_2017 2.0000 4.1183 60.0000
## grd_4_2017 3.0000 5.2338 59.0000
## grd_5_2017 4.0000 5.3296 42.0000
## grd_6_2017 0.0000 22.6901 223.0000
## grd_7_2017 0.0000 23.3718 253.0000
## grd_8_2017 0.0000 23.8366 219.0000
##
## Percent Balance Improvement:
## Mean Diff. eQQ Med eQQ Mean eQQ Max
## distance 38.9856 91.9854 39.1348 10.3680
## perc_34_all_2018_ela -417.7110 -36.3636 -64.0375 0.0000
## total_enrollment_2017 45.7400 47.4576 44.6503 28.1513
## grd_1_2017 26.6991 25.0000 24.3425 0.0000
## grd_2_2017 22.7004 33.3333 18.2670 0.0000
## grd_3_2017 28.5688 33.3333 17.2140 -22.4490
## grd_4_2017 -0.8167 0.0000 0.8009 -47.5000
## grd_5_2017 -15.4176 20.0000 -5.6983 0.0000
## grd_6_2017 33.2976 0.0000 33.5889 11.8577
## grd_7_2017 33.3607 0.0000 33.5921 9.9644
## grd_8_2017 33.2984 0.0000 33.0803 13.7795
##
## Sample sizes:
## Control Treated
## All 493 355
## Matched 355 355
## Unmatched 138 0
## Discarded 0 0
Propensity score matching improves balance for most features.
##
## Welch Two Sample t-test
##
## data: perc_34_all_2018_ela by rm_ded_media
## t = -1.0661, df = 706.75, p-value = 0.2867
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
## -4.629010 1.370982
## sample estimates:
## mean in group 0 mean in group 1
## 45.00451 46.63352
##
## Call:
## lm(formula = perc_34_all_2018_ela ~ rm_ded_media, data = m.data1)
##
## Residuals:
## Min 1Q Median 3Q Max
## -45.005 -16.587 -3.434 15.588 53.895
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 45.005 1.080 41.653 <2e-16 ***
## rm_ded_media 1.629 1.528 1.066 0.287
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 20.36 on 708 degrees of freedom
## Multiple R-squared: 0.001603, Adjusted R-squared: 0.0001926
## F-statistic: 1.137 on 1 and 708 DF, p-value: 0.2867
##
## Call:
## lm(formula = perc_34_all_2018_ela ~ ., data = .)
##
## Residuals:
## Min 1Q Median 3Q Max
## -125.283 -6.967 -0.691 6.486 46.678
##
## Coefficients:
## Estimate Std. Error t value Pr(>|t|)
## (Intercept) 6.781e+02 2.010e+01 33.738 < 2e-16 ***
## rm_ded_media 1.388e+00 9.212e-01 1.507 0.132
## total_enrollment_2017 2.824e-02 4.520e-03 6.248 7.23e-10 ***
## grd_1_2017 -4.954e-01 4.753e-02 -10.423 < 2e-16 ***
## grd_2_2017 1.219e+00 6.042e-02 20.167 < 2e-16 ***
## grd_3_2017 -1.194e+00 5.417e-02 -22.045 < 2e-16 ***
## grd_4_2017 4.809e-01 4.566e-02 10.533 < 2e-16 ***
## grd_5_2017 5.800e-01 4.021e-02 14.424 < 2e-16 ***
## grd_6_2017 2.189e-01 2.560e-02 8.549 < 2e-16 ***
## grd_7_2017 -7.749e-04 3.879e-02 -0.020 0.984
## grd_8_2017 1.275e+00 5.267e-02 24.206 < 2e-16 ***
## pscore -1.846e+03 5.745e+01 -32.124 < 2e-16 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 12.2 on 698 degrees of freedom
## Multiple R-squared: 0.6466, Adjusted R-squared: 0.6411
## F-statistic: 116.1 on 11 and 698 DF, p-value: < 2.2e-16
Matching suggests that rooms dedicated to media arts are associated with schools with higher ELA scores. The result that led to this investigation was that rooms dedicated to media arts are associated with lower state test scores among middle-schoolers. In light of this analysis I would not draw any firm conclusions, but would be interested to find some instrumental variable to use in lieu of an experiment.
The 2017-2018 Arts Survey Data has data about arts teachers, budgets, partnerships with cultural organizations and parental involvement in NYC public schools.
In an effort to gain greater context for this data, we can examine it in conjuction with publicly available ELA and Math state test results and demographic data.
My goals were to understand the state of arts programs in NYC schools, what variables affect the resources of arts programs, and whether arts programs have an effect on the academic performance of students.