[OQSPS] Inequality among 32 London Boroughs: An S factor analysis  Printable Version + OpenPsych forums (https://www.openpsych.net/forum) + Forum: Forums (https://www.openpsych.net/forum/forumdisplay.php?fid=1) + Forum: Postreview discussions (https://www.openpsych.net/forum/forumdisplay.php?fid=5) + Thread: [OQSPS] Inequality among 32 London Boroughs: An S factor analysis (/showthread.php?tid=249) Pages:
1
2

[OQSPS] Inequality among 32 London Boroughs: An S factor analysis  Emil  2015Sep23 Moved this submission to the submission forum, now that we have a journal and a review team. Emil I have another paper I don't know where to send to. I will post it here for now and try to get some reviewers to comment here. Title Inequality among 32 London Boroughs: An S factor analysis Abstract A dataset of 30 diverse socioeconomic variables was collected covering 32 London boroughs. Factor analysis of the data revealed a general socioeconomic factor. This factor was strongly related to GCSE scores (r's .813 to .819) and and had weak to medium sized negative relationships to demographic variables related to immigrants (r's .224 to .489). Jensen's method indicated that these relationships was related to the underlying general factor, especially for GCSE (Jensen coefficients .67 to .84, and .45 to .60). Key words: general socioeconomic factor, S factor, inequality, London, boroughs, United Kingdom, cognitive ability, IQ, intelligence, scholastic ability, GCSE, immigrants PDF and files: https://osf.io/p6fwh/ Publishing tweet: https://twitter.com/KirkegaardEmil/status/646598876032425984 RE: [sociology] Inequality among 32 London Boroughs: An S factor analysis  NoahCarl  2015Sep23 This paper analyses educational, demographic and socioeconomic data from 32 London boroughs. It derives a general socioeconomic factor, and confirms that this factor is correlated in the expected direction with the educational and demographic variables. The analyses are appropriate, and appear to support the conclusions enunciated in the text. In addition, the paper is clearly written, and adequately referenced. Therefore, I believe it is ready to published. I would, however, offer the following suggestions to the author: 1. Consider relabelling variables so that their names are easier to read (e.g., deleting underscores). 2. Consider including a horizontal line in Table 1, in order to separate variable names from the reported correlations. 3. Note that '% 5+ A*C with Eng. and Math.' is still a somewhat blunt measure of cognitive ability, insofar as: the exact grades are not specified, so a borough with X% getting 5+ Cs could theoretically obtain the same percentage as a borough with X% getting 5+ A*s; three of the five subjects are not specified, so could be either comparatively easy (e.g., Media Studies), or comparatively difficult (e.g., Physics); many children in Britain take 10 or more GCSEs. 4. Note that one cannot draw strong conclusions about the aggregatelevel correlation between educational achievement and cognitive ability across London boroughs from the aggregatelevel correlation between these two variables across countries. Reply to Noah #2  Emil  2015Sep24 Noah, Thanks for the review. 1) R does not like it when variable names have spaces (even in data.frames), so I have used underscores. One could also use CamelCase instead, but I think underscores are more readable. My preference is to keep the code consistent with the paper (for those who wish to analyze it more closely) over that of a slightly more polished presentation. 2) I have added borders to the table to make it more readable. 3) The achievement measure is clearly not optimal, but one has to make do with what there is. Perhaps one can find a better measure. You seem familiar with the data, could you perhaps tell me whether one of these measures are better? http://data.london.gov.uk/dataset/gcseresultslocationpupilresidenceborough It looks like there are several measures one could perhaps factor analyze or otherwise combine to get a composite measure: a) All Pupils at the End of KS4 Achieving 5+ A*  C b) All Pupils at the End of KS4 Achieving 5+ A*  G c) All Pupils at the End of KS4 Achieving 5+ A*  C Including English and Mathematics d) All Pupils at the End of KS4 Achieving 5+ A*  G Including English and Mathematics e) All Pupils at the End of KS4 Achieving the Basics f) All Pupils at the End of KS4 Entering the English Baccalaureate g) All pupils at the End of KS4 Achieving the English Baccalaureate h) Average GCSE and Equivalent Point Score Per Pupil at the End of KS4 i) Average Capped GCSE and Equivalent Point Score Per Pupil at the End of KS4 As far as I can tell, many of these are threshold versions of more continuous variables (cf. http://www.lagriffedulion.f2s.com/adverse.htm). Such variables have somewhat nonlinear relationships. Perhaps (h) is the best variable to use? It looks like a mean score type variable, meaning that no threshold transformation has been applied to it. I did analyze them. The currently used variable © has a factor loading of .96, but the loadings of all the variables are in the .69.98 range, so it would probably not matter so much. The highest loading is (i). In fact, because we have 9 variables all measuring scholastic ability, one can use Jensen's method. The prediction being that the variables that better measure scholastic ability should show higher correlations with the criteria variable (S). This was in fact found, r's .94.95 (depending on which S score vector was used). All correlations between GCSE variables and S were substantial r's .582 to .886. The strongest correlation is with (h) as one could expect because it is the underlying continuous variable. (i) seems to be some capped (?) version of this, which introduces a ceiling effect. So it seems to me that one should use (h). It seems somewhat unnecessary to include all this in the main part of the paper. Perhaps add an appendix discussing the GCSE variables and the above analysis? Let me know what you would prefer. 4) I agree. I changed it to: Quote:In line with much other research (15,16), one would expect higher cognitive ability to lead to higher S. The GCSE grades are not exactly an IQ test (17,18), but it has been found that at the nationallevel, scholastic ability and cognitive ability as measured by traditional IQ tests are nearly perfectly correlated (19,20). This suggests that it may also be a useful proxy at the boroughlevel, but this may not be the case. Prior research using similar data has found strong relationships between scholastic/ability ability and S, so a correlation in the vicinity of .40 to .90 would be expected here.  Files updated. RE: [sociology] Inequality among 32 London Boroughs: An S factor analysis  NoahCarl  2015Sep24 All changes proposed are fine. In regard to 3), I agree that (h) seems like the best overall measure of cognitive ability among those available. A short appendix discussing the different GCSE variables would suffice. RE: [sociology] Inequality among 32 London Boroughs: An S factor analysis  Emil  2015Sep25 Noah, I have uploaded a new version. It now has an appendix dealing with the GCSE stuff, 2 scatter plots of the main findings (GCSE x S, BAME x S), a brief analysis of mediation as well as the other changes discussed above. Files updated at OSF. RE: [sociology] Inequality among 32 London Boroughs: An S factor analysis  Emil  2015Sep25 I have asked Kenya Kura to review this. RE: [sociology] Inequality among 32 London Boroughs: An S factor analysis  Kenya Kura  2015Sep28 As I have read this manuscript, it seems to have been fully completed for publication without further analysis. Statistical analyses are enough sophisticated (e.g., Figure 1) and the results are very robust and consistent with previous findings like in Boston. Hereafter, let me just state two of my impressions of the findings. 1. The title of the Figure 4 should be “Scatter plot of S and Pct_BAME”. I found this relationship to be apparently weaker than the correlation in Figure 3, which makes a lot of sense because S and GCSE are stats from all students (or people in fact) including British gentiles. This relation may not as strong as the international S factor but should be fairly strong as the cases in Italy, Spain, or Japan with similar northsouth gradients. On the other hand, as the author acknowledges, the relation between S and Pct_BAME came from extremely diverse immigrant samples, including Scandinavians, who are very close to British people, to SubSaharan countries, Pakistan, India and China, who are very far at Fst level and also many different kinds of selection processes/pressures should have been existed. This also seems to be true for MCV of these two correlations in Figure 5, and 6. 2. The reason why S and female wage rate has a negative correlation is a puzzle. I doubt if women in affluent districts are not as eager to make money as those in poorer districts. For example, when husband earns a lot more, their (assortativemated) wives rather wants to be housewives and/or feel less obliged to work long hours or work in high paying jobs and so on. There may be a nonlinear relationship in this case. This is just my guess. It has been continuously found in a surprisingly consistent manner that S exists among human populations as the metafactor of the socioeconomic variables. I am afraid that they may be too inconvenient findings to acknowledge for the present time, when so many refugees desperately need help. RE: [sociology] Inequality among 32 London Boroughs: An S factor analysis  Emil  2015Sep30 Ken, Thank you for taking the time to review this. I have fixed the error with Figure 4. Furthermore, I reran the code. The splithalf factor reliability was lower in the rerun. Then I reran it with a 10x larger sample size (N=5000), giving a value of .79. I updated the paper accordingly. https://osf.io/f4uc2/files/ RE: [sociology] Inequality among 32 London Boroughs: An S factor analysis  Emil  2015Oct03 I have another dataset covering the same units. The dataset contains crime data for about 30 types of time given in a unusable format. I have converted it to a useful format and calculate per capita measures. This results in 60 variables. I have substantially rewritten the draft because of this. Results with regards to GCSE and immigrant variables were mostly unchanged. I have also added multiple regression fitting results (best subsets and lasso). Files updated: https://osf.io/f4uc2/files/ RE: [OQSPS] Inequality among 32 London Boroughs: An S factor analysis  ljzigerell  2016Jan22 The manuscript is good. I have attached a file with some grammar edits and comments. I approve the manuscript after those edits are addressed. A few notes that are not necessary to address for my approval: 1. I'm not sure it's necessary to add the Japan Sfactor results, but that's at least something to consider. 2. It might be worth considering the legibility of graphs. The text on the yaxis for Figure 4 overlaps slightly; Figures 5 and 6 have labels that fall off the graph or are on top of each other; and Figures 7 and 8 have a lot of labels on top of each other. If the labels aren't necessary for Figures 7 and 8, it might be best to exclude the labels; otherwise, I'm not sure of a solution. 3. The unexpected result for female pay might reflect a positive outcome. The dataset had a gross annual pay variable that was not disaggregated by sex, so maybe that would be a better measure in the future, if it's not theoretically clear that lower pay for one sex would be a negative outcome (because of, for instance, the assortive mating that Kenya mentioned). 