Skip to content

nasheya/Human-Development-Analysis

Repository files navigation

Human Development Analysis

We did some data analysis with the human development index, and this is my branch of it. You can see the other's work at:

https://github.com/klcopp/human-development-data

https://github.com/vivekramanan/data-mining-human-development-index

My branch of it deals with analyzing the UN's human development index versus the social inclusion index and values from the World Values Survey representing general "human-ness".

The social inclusion index versus the HDI returned some pretty terrible results - got an R^2 value of 0.24.

For the World Values Survey, I picked the following values:

  • Mention some qualities that children should learn at home. (Binary values – mentioned = 1 or not mentioned = 2)

    • V15. Imagination
    • V16. Tolerance and respect for others
    • V20. Unselfishness
    • V22. Self-expression
  • Which of these organizations are you an active member, inactive member, or don’t belong? (Active member = 2, Inactive member = 1, Don’t belong = 0)

    • V27. Art, music, or educational organization
    • V32. Humanitarian or charitable organization
  • Describing some people, how much are they like you? (1 = very much like me and 6 = very much not like me)

    • V70. Important to this person to think of new ideas and be creative, to do things one’s own way
    • V74B. Important for this person to help the people nearby, to care for their well-being
  • How often, if at all, do you think about the meaning and purpose of life?

    • V143. (1 = Often, 2 = Sometimes, 3 = Rarely, 4 = Never)
  • I see myself as someone who… (1 = Disagree strongly and 5 = Agree Strongly, and 9 = Don’t know)

    • V160E. Has few artistic interests
    • V160J. Has an active imagination

All of these features together yielded a R^2 = 0.695.

After doing all this, I did the "leave one out method" to see which features had the most impact. A variable name means the effect on the R^2 and adjusted R^2 had by leaving that variable out. Here are my results from that:

  • All: r2 = 0.695, ar2 = 0.505
  • V15: r2 = 0.694, ar2 = 0.518
  • V16: r2 = 0.688, ar2 = 0.508
  • V20: r2 = 0.679, ar2 = 0.494
  • V22: r2 = 0.690, ar2 = 0.511
  • V27_1: r2 = 0.639, ar2 = 0.432
  • V32_1: r2 = 0.689, ar2 = 0.509
  • V27_2: r2 = 0.667, ar2 = 0.475
  • V32_2: r2 = 0.693, ar2 = 0.516
  • V70_1: r2 = 0.681, ar2 = 0.497
  • V70_2: r2 = 0.694, ar2 = 0.517
  • V70_3: r2 = 0.659, ar2 = 0.463
  • V74B_1: r2 = 0.680, ar2 = 0.497
  • V74B_2: r2 = 0.643, ar2 = 0.437
  • V74B_3: r2 = 0.649, ar2 = 0.446
  • V143_1: r2 = 0.682, ar2 = 0.499
  • V143_2: r2 = 0.692, ar2 = 0.515
  • V160E_1: r2 = 0.691, ar2 = 0.513
  • V160E_2: r2 = 0.695, ar2 = 0.519
  • V160J_5: r2 = 0.691, ar2 = 0.513
  • V160J_4: r2 = 0.695, ar2 = 0.520

In some cases, the general R^2 was kept and the adjusted R^2 went up.

About

Our final project for data mining involving work with the UN world development indexes dataset

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors