The 15 Most Important Degrees for Entrepeneurship

John Vandivier

While conducting <a href="https://github.com/Vandivier/data-science-practice/tree/master/stata-practice">an EDA of the ACS PUMS data set, one interesting thing I encountered was the relationship between the undergraduate degree a person received and their rate of self-employment. This article discusses that finding.

Two variables regressed include FOD1P and FOD2P which are variables for the "Field of Degree." I believe the degree referred to is the Bachelor's Degree as per the <a href="http://www2.census.gov/programs-surveys/acs/methodology/questionnaires/2014/quest14.pdf">ACS questionnaire which allows an individual to name multiple field in question 12. Statistically, after correcting for a number of variables the FOD2P variable was insignificant.

Most individuals only have one undergraduate degree and/or one major field for their undergraduate degree. However, a significant number of individuals do not follow that pattern. When an individual lists multiple fields for their Bachelor's Degrees, there is no mechanism in the survey to detect:

  1. Whether multiple fields belong to separate degrees, or whether a single degree has multiple major fields
  2. Which order the degrees were obtained in
So we can establish that the first degree listed is of statistical significance and that the second one listed is not, but it does not follow that this is the individual's chronologically first earned degree. It is simply the first one they chose to list, which could be determined by any number of criteria. It might be the one they earned first or the one they earned most recently. It might be the degree or field that the surveyed individual considers most important because it is the credential they generally advertise, or it may be due to some other criteria or chance. More transparency into this variable is one way the ACS could be improved.

Each of these degrees had a coefficient which was statistically significant at the 2% level and the B value, which is the coefficient, identifies the strength of the relationship between the independent variable and the rate of self-employment. If B is negative then an individual having that degree is associated with a reduced probability that they will be self-employed. I also added an asterisk in front of the degree name to help these stick out. If B is positive then having that degree is associated with an increased chance to be self-employed.

Here are the 15 most important degrees:

  1. General Agriculture
    1. _fod1p1, field code 1100, B = .091
  2. Agricultural Production and Management
    1. _fod1p2, field code 1101, B = .110
  3. Agricultural Economics
    1. _fod1p3, field code 1102, B = .126
  4. Animal Science
    1. _fod1p4, field code 1103, B = .093
  5. Plant Science and Agronomy
    1. _fod1p6, field code 1105, B = .086
  6. Architecture
    1. _fod1p12, field code 1401, B = .104
  7. *Educational Administration and Supervision
    1. _fod1p27, field code 2301, B = -.077
  8. Zoology
    1. _fod1p88, field code 3609, B = .071
  9. Fine Arts
    1. _fod1p139, field code 6000, B = .075
  10. Music
    1. _fod1p141, field code 6002, B = .085
  11. Visual and Performing Arts
    1. _fod1p142, field code 6003, B = .079
  12. Commercial Art and Graphic Design
    1. _fod1p143, field code 6004, B = .091
  13. Film Video and Photographic Arts
    1. _fod1p144, field code 6005, B = .131
  14. Studio Arts
    1. _fod1p146, field code 6007, B = .076
  15. Health and Medical Preparatory Programs
    1. _fod1p153, field code 6106, B = .103
Also notice which degrees are conspicuously absent: Computer Science has no statistical significance, which works against the theory that programmers will tend to be entrepreneurs due to the low barriers to entry in the IT industry.

It's also important to note that these results were obtained without correcting for income or wealth. Keep an eye out for a later article where I will revisit after including those variables in the regression.

If you are following along with the analysis.do file, notice that each variable in the variable series _fod1p* receives a label inside of STATA which identifies the recoded field of degree as per <a href="http://www2.census.gov/programs-surveys/acs/tech_docs/pums/data_dict/PUMSDataDict14.pdf">the code book beginning on P57.