From a67da2ab11177d1a64c067214a9b54c13193fea1 Mon Sep 17 00:00:00 2001 From: scdavis50 Date: Wed, 23 Mar 2016 16:42:50 -0500 Subject: [PATCH 01/26] Transcript of my data science studies plan. --- transcripts/scott-davis-transcript.md | 130 ++++++++++++++++++++++++++ 1 file changed, 130 insertions(+) create mode 100644 transcripts/scott-davis-transcript.md diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md new file mode 100644 index 00000000..452fa4c7 --- /dev/null +++ b/transcripts/scott-davis-transcript.md @@ -0,0 +1,130 @@ +

Scott Davis Transcript

+

Open Source Data Science Masters

+ +
I'm going to have some time for indepedent study this year so I plan to work through as much as possible. I work in the real estate industry and we have so much data that isn't used for meaningful analysis and the tools, though readily available, haven't caught up for the needs of real estate users. That's what I'm interested in working on. I use a lot of GIS and R, so my curriculum is tailored to follow [R](https://www.r-project.org/)/[Python](www.python.org) and [QGIS](www.qgis.org). I'm a bit of an open-source nut so I like learning much better this way. I'm looking for people to connect with, and possibly to work on projects.
+ +Want to collaborate? Get in touch: + * [linkedin](http://www.linkedin.com/in/scottcdavis); + * [twitter](http://www.twitter.com/scottdavisCRE); or + * [email](mailto:scott@tisonadevelopment.com) + + +

Open Source Curriculum

+

Base Introduction

+Data Science Introductions + - [ ] Intro to Data Science by UW / Coursera, online course + - [ ] Data Science Specialization by Johns Hopkins / Coursera + - [X] [Data Scientists Toolbox](https://www.coursera.org/account/accomplishments/certificate/UY4EBM46HL) + - [X] [R Programming](https://www.coursera.org/account/accomplishments/records/Va5vuEvGKyr7UyHEL) + - [X] [Getting and Cleaning Data](https://www.coursera.org/account/accomplishments/records/ENSGmvNfx24sANRW) + - [X] [Exploratory Data Analysis](https://www.coursera.org/account/accomplishments/records/2PPsRu2Us3sUehBQ) + - [ ] [Reproducible Research] (in progress) + - [ ] [Statistical Inference] + - [ ] [Regression Models] + - [ ] [Practical Machine Learning] (in progress) + - [ ] [Developing Data Products] + - [ ] [Data Science Capstone] +- [ ] [Data Science by Harvard](http://cs109.github.io/2015/) (online course) +- [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) +- [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) +- [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - in Excel, but also works in LibreOffice and so much of business analytics is still in Excel. + + +

Mathematics/Statistics

+ - [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) + - [ ] [Statistics for Spatio-Temporal Data](http://www.wiley.com/WileyCDA/WileyTitle/productCd-EHEP002348.html) + - [ ] [Linear Algebra](http://www.amazon.com/Linear-Algebra-Dover-Books-Mathematics/dp/048663518X) + - [ ] Problem-Solving Heuristics: [How to Solve It](http://www.amazon.com/How-Solve-It-Mathematical-Princeton/dp/069111966X) + +

Computing

+R: + - [ ] [R in Action](https://www.manning.com/books/r-in-action-second-edition?a_bid=5c2b1e1d&a_aid=RiA2ed) + - [ ] [R Cookbook](http://shop.oreilly.com/product/9780596809164.do) + - [ ] [Forecasting: Principles and Practice](http://otexts.com/fpp/) + +R Libraries/Task Views + * [ProjectTemplate](http://projecttemplate.net/index.html) + * Spatial Data [CRAN Task View: Analysis of Spatial Data](https://cran.r-project.org/web/views/Spatial.html) + * Spatio-Temporal Data [CRAN Task View: Handling and Analyzing Spatio-Temporal Data](https://cran.r-project.org/web/views/SpatioTemporal.html) + * Optimization [CRAN Task View: Optimization and Mathematical Programming](https://cran.r-project.org/web/views/Optimization.html) + * Finance [CRAN Task View: Empirical Finance](https://cran.r-project.org/web/views/Finance.html) + +Python: + - [ ] [Dive Into Python](http://www.diveintopython.net/) + - [ ] [Google's Python Class](code.google.com/edu/languages/google-python-class/) + - [ ] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) + - [ ] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) + +QGIS: + - [ ] [QGIS Tutorials and Tips](http://www.qgistutorials.com/en/) + - [ ] [Mastering QGIS](https://www.packtpub.com/application-development/mastering-qgis) + - [ ] [Building Mapping Applications with QGIS](https://www.packtpub.com/application-development/building-mapping-applications-qgis) + +MySQL: + - [ ] [Learn MySQL in One Video](https://www.youtube.com/watch?v=yPu6qV5byu4) + - [ ] [MySQL Workbench Starter](code.google.com/edu/languages/google-python-class/) + +Octave: + - [ ] [GNU Octave Beginners Guide](https://www.packtpub.com/big-data-and-business-intelligence/gnu-octave-beginners-guide) + - +PostGIS/PostGRESQL: + - [ ] [PostGIS Essentials](https://www.packtpub.com/big-data-and-business-intelligence/postgis-essentials) + - [ ] [PostGRESQL Tutorial](http://www.postgresqltutorial.com/) + - [ ] [PostgreSQL: Up and Running: A Practical Introduction to the Advanced Open Source Database](http://shop.oreilly.com/product/0636920032144.do) + +

Algorithms

+ - [ ] [Algorithms Design & Analysis](http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=IntroToAlgorithms) Stanford openclassroom + +

Distributed Computing Paradigms

+ - [ ] Intro to Hadoop and MapReduce by Cloudera and Udacity +*Note: I might swap the above course with an EdX course on Apache Spark and distributed computing* + +

Data Mining

+ - [ ] Mining Massive Data Sets, by Stanford and Coursera + - [ ] [Clean Data] (https://www.packtpub.com/big-data-and-business-intelligence/clean-data) + +

Machine Learning/Predictive Analytics - Foundational/Theoretical/Practical

+ - [ ] Machine Learning, by Ng Stanford and Coursera (NB this class requires a lot of higher level math) + - [ ] [An Introduction to Statistical Learning with Applications in R](http://www.r-bloggers.com/in-depth-introduction-to-machine-learning-in-15-hours-of-expert-videos/) (by the authors of The Elements of Statistical Learning at Stanford.) + - [ ] [Machine Learning with R](https://www.packtpub.com/big-data-and-business-intelligence/machine-learning-r-second-edition) + - [ ] [Building a Recommendation System in R](https://www.packtpub.com/big-data-and-business-intelligence/building-recommendation-system-r) + - [ ] [Mastering Predictive Analytics in R](https://www.packtpub.com/application-development/mastering-predictive-analytics-r) + - [ ] [Bootstrapping Machine Learning](http://www.louisdorard.com/machine-learning-book/) + - [ ] [Applied Predictive Modeling] (http://www.amazon.com/gp/product/1461468485?psc=1&redirect=true&ref_=oh_aui_detailpage_o08_s00) + +

Analysis

+ - [ ] [Practical Data Science Cookbook](http://www.diveintopython.net/) + - [ ] [R Data Analysis Cookbook](code.google.com/edu/languages/google-python-class/) + +

Spatial Analysis

+ - [ ] [An Introduction to R for Spatial Analysis and Mapping](http://www.edwardtufte.com/tufte/books_be) + - [ ] [Applied Spatial Data Analysis with R](http://www.springer.com/us/book/9781461476177) + +

Land Use/Transport/Gravity Modeling

+ - [ ] [Integrated Land Use and Transport Modelling: Decision Chains and Hierarchies](http://www.amazon.com/gp/product/0521022177?psc=1&redirect=true&ref_=oh_aui_detailpage_o03_s00) + - [ ] [Gravity and Spatial Interaction Models (Scientific Geography Series)](http://www.amazon.com/gp/product/0803925441?psc=1&redirect=true&ref_=oh_aui_detailpage_o06_s00) + - [ ] [TRANUS Model](http://www.tranus.com/tranus-english) + - [ ] [Urban Sim](https://pypi.python.org/pypi/urbansim) + - [ ] [Huff-tools Package in R] (http://rstudio-pubs-static.s3.amazonaws.com/42357_1e6fcc5bcfec439096eb86a106ebf22e.html) + - +

Data Design/Data Viz

+ - [ ] [Beautiful Evidence](http://www.edwardtufte.com/tufte/books_be) + - [ ] [Semiology of Graphics](http://www.amazon.com/Semiology-Graphics-Diagrams-Networks-Maps/dp/1589482611) + - [ ] [Visual Complexity Mapping Patterns of Information](hhttp://www.visualcomplexity.com/vc/book/) + - [ ] [The Visual Display of Quantitative Information](http://www.edwardtufte.com/tufte/books_vdqi) + - [ ] [Design for Information](http://isabelmeirelles.com/book-design-for-information/) + - [ ] [Design Elements: A Graphical Style Manual](http://www.amazon.com/Design-Elements-Graphic-Style-Manual/dp/1592532616) + - [ ] [Storytelling with Data] (http://www.amazon.com/gp/product/1119002257?psc=1&redirect=true&ref_=oh_aui_detailpage_o09_s00) + - [ ] [Mastering Python Data Visualization](https://www.packtpub.com/big-data-and-business-intelligence/mastering-python-data-visualization) + - [ ] [The Grammar of Graphics](https://www.packtpub.com/big-data-and-business-intelligence/mastering-python-data-visualization) + - [ ] [R Graphics Cookbook](http://shop.oreilly.com/product/9780596809164.do) + +

Relevant prior studies

+ - [X] MS in Community and Regional Planning, UT-Austin + - [X] BA in Liberal Arts, concentration in geography, UT-Austin + +

OpenSource Data Science Masters Capstone Project

+I'm interesting in using data science approaches for better intelligence behind real estate decisions, specifically evaluating population growth, transactions and location decisions. I'd also like to evaluate statistical learning technqiues to make better pricing decisions. Finally, I'd like to develop a model to optimize real estate portfolios. + +If you'd like to pair up for the capstone, [let me know](http://www.twitter.com/scottdavisCRE) + From 244c3c6718fcdbdc5fa1cdcccca76d4471c18ef3 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Wed, 23 Mar 2016 16:46:33 -0500 Subject: [PATCH 02/26] Updated transcript fixing the formatting. --- transcripts/scott-davis-transcript.md | 9 ++++----- 1 file changed, 4 insertions(+), 5 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 452fa4c7..70ec69f4 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -1,7 +1,7 @@ -

Scott Davis Transcript

-

Open Source Data Science Masters

+

Scott Davis Transcript

+

Open Source Data Science Masters

-
I'm going to have some time for indepedent study this year so I plan to work through as much as possible. I work in the real estate industry and we have so much data that isn't used for meaningful analysis and the tools, though readily available, haven't caught up for the needs of real estate users. That's what I'm interested in working on. I use a lot of GIS and R, so my curriculum is tailored to follow [R](https://www.r-project.org/)/[Python](www.python.org) and [QGIS](www.qgis.org). I'm a bit of an open-source nut so I like learning much better this way. I'm looking for people to connect with, and possibly to work on projects.
+I'm going to have some time for indepedent study this year so I plan to work through as much as possible. I work in the real estate industry and we have so much data that isn't used for meaningful analysis and the tools, though readily available, haven't caught up for the needs of real estate users. That's what I'm interested in working on. I use a lot of GIS and R, so my curriculum is tailored to follow [R](https://www.r-project.org/)/[Python](www.python.org) and [QGIS](www.qgis.org). I'm a bit of an open-source nut so I like learning much better this way. I'm looking for people to connect with, and possibly to work on projects. Want to collaborate? Get in touch: * [linkedin](http://www.linkedin.com/in/scottcdavis); @@ -29,7 +29,6 @@ Data Science Introductions - [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) - [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - in Excel, but also works in LibreOffice and so much of business analytics is still in Excel. -

Mathematics/Statistics

- [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) - [ ] [Statistics for Spatio-Temporal Data](http://www.wiley.com/WileyCDA/WileyTitle/productCd-EHEP002348.html) @@ -123,7 +122,7 @@ PostGIS/PostGRESQL: - [X] MS in Community and Regional Planning, UT-Austin - [X] BA in Liberal Arts, concentration in geography, UT-Austin -

OpenSource Data Science Masters Capstone Project

+

OpenSource Data Science Masters Capstone Project

I'm interesting in using data science approaches for better intelligence behind real estate decisions, specifically evaluating population growth, transactions and location decisions. I'd also like to evaluate statistical learning technqiues to make better pricing decisions. Finally, I'd like to develop a model to optimize real estate portfolios. If you'd like to pair up for the capstone, [let me know](http://www.twitter.com/scottdavisCRE) From bcbc0f4e8bb12b659036aba700abd0dfeff65904 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Wed, 23 Mar 2016 16:49:07 -0500 Subject: [PATCH 03/26] Update scott-davis-transcript.md --- transcripts/scott-davis-transcript.md | 14 +++++++------- 1 file changed, 7 insertions(+), 7 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 70ec69f4..31f89e44 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -1,5 +1,5 @@

Scott Davis Transcript

-

Open Source Data Science Masters

+

Open Source Data Science Masters

I'm going to have some time for indepedent study this year so I plan to work through as much as possible. I work in the real estate industry and we have so much data that isn't used for meaningful analysis and the tools, though readily available, haven't caught up for the needs of real estate users. That's what I'm interested in working on. I use a lot of GIS and R, so my curriculum is tailored to follow [R](https://www.r-project.org/)/[Python](www.python.org) and [QGIS](www.qgis.org). I'm a bit of an open-source nut so I like learning much better this way. I'm looking for people to connect with, and possibly to work on projects. @@ -29,7 +29,7 @@ Data Science Introductions - [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) - [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - in Excel, but also works in LibreOffice and so much of business analytics is still in Excel. -

Mathematics/Statistics

+

Mathematics/Statistics

- [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) - [ ] [Statistics for Spatio-Temporal Data](http://www.wiley.com/WileyCDA/WileyTitle/productCd-EHEP002348.html) - [ ] [Linear Algebra](http://www.amazon.com/Linear-Algebra-Dover-Books-Mathematics/dp/048663518X) @@ -71,18 +71,18 @@ PostGIS/PostGRESQL: - [ ] [PostGRESQL Tutorial](http://www.postgresqltutorial.com/) - [ ] [PostgreSQL: Up and Running: A Practical Introduction to the Advanced Open Source Database](http://shop.oreilly.com/product/0636920032144.do) -

Algorithms

+

Algorithms

- [ ] [Algorithms Design & Analysis](http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=IntroToAlgorithms) Stanford openclassroom

Distributed Computing Paradigms

- [ ] Intro to Hadoop and MapReduce by Cloudera and Udacity *Note: I might swap the above course with an EdX course on Apache Spark and distributed computing* -

Data Mining

+

Data Mining

- [ ] Mining Massive Data Sets, by Stanford and Coursera - [ ] [Clean Data] (https://www.packtpub.com/big-data-and-business-intelligence/clean-data) -

Machine Learning/Predictive Analytics - Foundational/Theoretical/Practical

+

Machine Learning/Predictive Analytics - Foundational/Theoretical/Practical

- [ ] Machine Learning, by Ng Stanford and Coursera (NB this class requires a lot of higher level math) - [ ] [An Introduction to Statistical Learning with Applications in R](http://www.r-bloggers.com/in-depth-introduction-to-machine-learning-in-15-hours-of-expert-videos/) (by the authors of The Elements of Statistical Learning at Stanford.) - [ ] [Machine Learning with R](https://www.packtpub.com/big-data-and-business-intelligence/machine-learning-r-second-edition) @@ -91,7 +91,7 @@ PostGIS/PostGRESQL: - [ ] [Bootstrapping Machine Learning](http://www.louisdorard.com/machine-learning-book/) - [ ] [Applied Predictive Modeling] (http://www.amazon.com/gp/product/1461468485?psc=1&redirect=true&ref_=oh_aui_detailpage_o08_s00) -

Analysis

+

Analysis

- [ ] [Practical Data Science Cookbook](http://www.diveintopython.net/) - [ ] [R Data Analysis Cookbook](code.google.com/edu/languages/google-python-class/) @@ -122,7 +122,7 @@ PostGIS/PostGRESQL: - [X] MS in Community and Regional Planning, UT-Austin - [X] BA in Liberal Arts, concentration in geography, UT-Austin -

OpenSource Data Science Masters Capstone Project

+

OpenSource Data Science Masters Capstone Project

I'm interesting in using data science approaches for better intelligence behind real estate decisions, specifically evaluating population growth, transactions and location decisions. I'd also like to evaluate statistical learning technqiues to make better pricing decisions. Finally, I'd like to develop a model to optimize real estate portfolios. If you'd like to pair up for the capstone, [let me know](http://www.twitter.com/scottdavisCRE) From f306f389f60dfccc9ecf5edc5163e4232f451091 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Wed, 23 Mar 2016 16:50:27 -0500 Subject: [PATCH 04/26] Update scott-davis-transcript.md --- transcripts/scott-davis-transcript.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 31f89e44..a99cd68d 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -65,7 +65,7 @@ MySQL: Octave: - [ ] [GNU Octave Beginners Guide](https://www.packtpub.com/big-data-and-business-intelligence/gnu-octave-beginners-guide) - - + PostGIS/PostGRESQL: - [ ] [PostGIS Essentials](https://www.packtpub.com/big-data-and-business-intelligence/postgis-essentials) - [ ] [PostGRESQL Tutorial](http://www.postgresqltutorial.com/) @@ -74,7 +74,7 @@ PostGIS/PostGRESQL:

Algorithms

- [ ] [Algorithms Design & Analysis](http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=IntroToAlgorithms) Stanford openclassroom -

Distributed Computing Paradigms

+

Distributed Computing Paradigms

- [ ] Intro to Hadoop and MapReduce by Cloudera and Udacity *Note: I might swap the above course with an EdX course on Apache Spark and distributed computing* From 4464351e4392af295c94723fae98a11b269c5ecc Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Wed, 23 Mar 2016 16:51:28 -0500 Subject: [PATCH 05/26] Update scott-davis-transcript.md --- transcripts/scott-davis-transcript.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index a99cd68d..e0d2002c 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -109,7 +109,7 @@ PostGIS/PostGRESQL:

Data Design/Data Viz

- [ ] [Beautiful Evidence](http://www.edwardtufte.com/tufte/books_be) - [ ] [Semiology of Graphics](http://www.amazon.com/Semiology-Graphics-Diagrams-Networks-Maps/dp/1589482611) - - [ ] [Visual Complexity Mapping Patterns of Information](hhttp://www.visualcomplexity.com/vc/book/) + - [ ] [Visual Complexity Mapping Patterns of Information](http://www.visualcomplexity.com/vc/book/) - [ ] [The Visual Display of Quantitative Information](http://www.edwardtufte.com/tufte/books_vdqi) - [ ] [Design for Information](http://isabelmeirelles.com/book-design-for-information/) - [ ] [Design Elements: A Graphical Style Manual](http://www.amazon.com/Design-Elements-Graphic-Style-Manual/dp/1592532616) From 7e8245feaf3d7aa4b5c640eacbadf092b9b8988a Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Sat, 16 Apr 2016 20:46:52 -0600 Subject: [PATCH 06/26] Added a couple of resources, fixed tags --- scott-davis-transcript.md | 133 ++++++++++++++++++++++++++++++++++++++ 1 file changed, 133 insertions(+) create mode 100644 scott-davis-transcript.md diff --git a/scott-davis-transcript.md b/scott-davis-transcript.md new file mode 100644 index 00000000..e054318a --- /dev/null +++ b/scott-davis-transcript.md @@ -0,0 +1,133 @@ +

Scott Davis Transcript

+

Open Source Data Science Masters

+ +
I'm going to have some time for indepedent study this year so I plan to work through as much as possible. I work in the real estate industry and we have so much data that isn't used for meaningful analysis and the tools, though readily available, haven't caught up for the needs of real estate users. That's what I'm interested in working on. I use a lot of GIS and R, so my curriculum is tailored to follow [R](https://www.r-project.org/)/[Python](www.python.org) and [QGIS](www.qgis.org). I'm a bit of an open-source nut so I like learning much better this way. I'm looking for people to connect with, and possibly to work on projects.
+ +Want to collaborate? Get in touch: + * [linkedin](http://www.linkedin.com/in/scottcdavis); + * [twitter](http://www.twitter.com/scottdavisCRE); or + * [email](mailto:scott@tisonadevelopment.com) + + +

Open Source Curriculum

+

Base Introduction

+Data Science Introductions + - [ ] Intro to Data Science by UW / Coursera, online course + - [ ] Data Science Specialization by Johns Hopkins / Coursera + - [X] [Data Scientists Toolbox](https://www.coursera.org/account/accomplishments/certificate/UY4EBM46HL) + - [X] [R Programming](https://www.coursera.org/account/accomplishments/records/Va5vuEvGKyr7UyHEL) + - [X] [Getting and Cleaning Data](https://www.coursera.org/account/accomplishments/records/ENSGmvNfx24sANRW) + - [X] [Exploratory Data Analysis](https://www.coursera.org/account/accomplishments/records/2PPsRu2Us3sUehBQ) + - [X] [Reproducible Research] + - [ ] [Statistical Inference] (in progress) + - [ ] [Regression Models] (in progress) + - [X] [Practical Machine Learning] + - [ ] [Developing Data Products] + - [ ] [Data Science Capstone] +- [ ] [Data Science by Harvard](http://cs109.github.io/2015/) (online course) +- [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) +- [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) +- [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - in Excel, but also works in LibreOffice and so much of business analytics is still in Excel. + + +

Mathematics/Statistics

+ - [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) + - [ ] [Statistics for Spatio-Temporal Data](http://www.wiley.com/WileyCDA/WileyTitle/productCd-EHEP002348.html) + - [ ] [Linear Algebra](http://www.amazon.com/Linear-Algebra-Dover-Books-Mathematics/dp/048663518X) + - [ ] Problem-Solving Heuristics: [How to Solve It](http://www.amazon.com/How-Solve-It-Mathematical-Princeton/dp/069111966X) + +

Computing

+R: + - [ ] [R in Action](https://www.manning.com/books/r-in-action-second-edition?a_bid=5c2b1e1d&a_aid=RiA2ed) + - [ ] [R Cookbook](http://shop.oreilly.com/product/9780596809164.do) + - [ ] [Forecasting: Principles and Practice](http://otexts.com/fpp/) + +R Libraries/Task Views + * [ProjectTemplate](http://projecttemplate.net/index.html) + * Spatial Data [CRAN Task View: Analysis of Spatial Data](https://cran.r-project.org/web/views/Spatial.html) + * Spatio-Temporal Data [CRAN Task View: Handling and Analyzing Spatio-Temporal Data](https://cran.r-project.org/web/views/SpatioTemporal.html) + * Optimization [CRAN Task View: Optimization and Mathematical Programming](https://cran.r-project.org/web/views/Optimization.html) + * Finance [CRAN Task View: Empirical Finance](https://cran.r-project.org/web/views/Finance.html) + +Python: + - [ ] [Dive Into Python](http://www.diveintopython.net/) + - [ ] [Google's Python Class](code.google.com/edu/languages/google-python-class/) + - [ ] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) + - [ ] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) + +QGIS: + - [X] [QGIS Tutorials and Tips](http://www.qgistutorials.com/en/) + - [X] [Mastering QGIS](https://www.packtpub.com/application-development/mastering-qgis) + - [ ] [Building Mapping Applications with QGIS](https://www.packtpub.com/application-development/building-mapping-applications-qgis) + - [ ] [GIS Tutorial Workbook 1](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=232&moduleID=1) This is for ArcView, but you can work the examples in QGIS too + - [ ] [GIS Tutorial Workbook 2: Spatial Analysis](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=230&moduleID=0) This is for ArcView, but you can work the examples in QGIS too + - [ ] [QGIS Map Design](https://locatepress.com/qmd) I've just thumbed through this, but it's beautiful and belongs on any list of GIS books. + +MySQL: + - [ ] [Learn MySQL in One Video](https://www.youtube.com/watch?v=yPu6qV5byu4) + - [ ] [MySQL Workbench Starter](code.google.com/edu/languages/google-python-class/) + +Octave: + - [ ] [GNU Octave Beginners Guide](https://www.packtpub.com/big-data-and-business-intelligence/gnu-octave-beginners-guide) + - +PostGIS/PostGRESQL: + - [ ] [PostGIS Essentials](https://www.packtpub.com/big-data-and-business-intelligence/postgis-essentials) + - [ ] [PostGRESQL Tutorial](http://www.postgresqltutorial.com/) + - [ ] [PostgreSQL: Up and Running: A Practical Introduction to the Advanced Open Source Database](http://shop.oreilly.com/product/0636920032144.do) + +

Algorithms

+ - [ ] [Algorithms Design & Analysis](http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=IntroToAlgorithms) Stanford openclassroom + +

Distributed Computing Paradigms

+ - [ ] Intro to Hadoop and MapReduce by Cloudera and Udacity +*Note: I might swap the above course with an EdX course on Apache Spark and distributed computing* + +

Data Mining

+ - [ ] Mining Massive Data Sets, by Stanford and Coursera + - [ ] [Clean Data](https://www.packtpub.com/big-data-and-business-intelligence/clean-data) + +

Machine Learning/Predictive Analytics - Foundational/Theoretical/Practical

+ - [ ] Machine Learning, by Ng Stanford and Coursera (NB this class requires a lot of higher level math) + - [ ] [An Introduction to Statistical Learning with Applications in R](http://www.r-bloggers.com/in-depth-introduction-to-machine-learning-in-15-hours-of-expert-videos/) (by the authors of The Elements of Statistical Learning at Stanford.) + - [ ] [Machine Learning with R](https://www.packtpub.com/big-data-and-business-intelligence/machine-learning-r-second-edition) + - [ ] [Building a Recommendation System in R](https://www.packtpub.com/big-data-and-business-intelligence/building-recommendation-system-r) + - [ ] [Mastering Predictive Analytics in R](https://www.packtpub.com/application-development/mastering-predictive-analytics-r) + - [ ] [Bootstrapping Machine Learning](http://www.louisdorard.com/machine-learning-book/) + - [ ] [Applied Predictive Modeling](http://www.amazon.com/gp/product/1461468485?psc=1&redirect=true&ref_=oh_aui_detailpage_o08_s00) + +

Analysis

+ - [ ] [Practical Data Science Cookbook](http://www.diveintopython.net/) + - [ ] [R Data Analysis Cookbook](code.google.com/edu/languages/google-python-class/) + +

Spatial Analysis

+ - [ ] [An Introduction to R for Spatial Analysis and Mapping](https://us.sagepub.com/en-us/nam/an-introduction-to-r-for-spatial-analysis-and-mapping/book241031) + - [ ] [Applied Spatial Data Analysis with R](http://www.springer.com/us/book/9781461476177) + +

Land Use/Transport/Gravity Modeling

+ - [ ] [Integrated Land Use and Transport Modelling: Decision Chains and Hierarchies](http://www.amazon.com/gp/product/0521022177?psc=1&redirect=true&ref_=oh_aui_detailpage_o03_s00) + - [ ] [Gravity and Spatial Interaction Models (Scientific Geography Series)](http://www.amazon.com/gp/product/0803925441?psc=1&redirect=true&ref_=oh_aui_detailpage_o06_s00) + - [ ] [TRANUS Model](http://www.tranus.com/tranus-english) + - [ ] [Urban Sim](https://pypi.python.org/pypi/urbansim) + - [ ] [Huff-tools Package in R](http://rstudio-pubs-static.s3.amazonaws.com/42357_1e6fcc5bcfec439096eb86a106ebf22e.html) + - +

Data Design/Data Viz

+ - [ ] [Beautiful Evidence](http://www.edwardtufte.com/tufte/books_be) + - [ ] [Semiology of Graphics](http://www.amazon.com/Semiology-Graphics-Diagrams-Networks-Maps/dp/1589482611) + - [ ] [Visual Complexity Mapping Patterns of Information](hhttp://www.visualcomplexity.com/vc/book/) + - [ ] [The Visual Display of Quantitative Information](http://www.edwardtufte.com/tufte/books_vdqi) + - [ ] [Design for Information](http://isabelmeirelles.com/book-design-for-information/) + - [ ] [Design Elements: A Graphical Style Manual](http://www.amazon.com/Design-Elements-Graphic-Style-Manual/dp/1592532616) + - [ ] [Storytelling with Data](http://www.amazon.com/gp/product/1119002257?psc=1&redirect=true&ref_=oh_aui_detailpage_o09_s00) + - [ ] [Mastering Python Data Visualization](https://www.packtpub.com/big-data-and-business-intelligence/mastering-python-data-visualization) + - [ ] [The Grammar of Graphics](https://www.packtpub.com/big-data-and-business-intelligence/mastering-python-data-visualization) + - [ ] [R Graphics Cookbook](http://shop.oreilly.com/product/9780596809164.do) + +

Relevant prior studies

+ - [X] MS in Community and Regional Planning, UT-Austin + - [X] BA in Liberal Arts, concentration in geography, UT-Austin + +

OpenSource Data Science Masters Capstone Project

+I'm interesting in using data science approaches for better intelligence behind real estate decisions, specifically evaluating population growth, transactions and location decisions. I'd also like to evaluate statistical learning technqiues to make better pricing decisions. Finally, I'd like to develop a model to optimize real estate portfolios. + +If you'd like to pair up for the capstone, [let me know](http://www.twitter.com/scottdavisCRE) + From 70ca8069450152b45d44bd46a54362d8f82d6439 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Mon, 18 Apr 2016 18:17:45 -0500 Subject: [PATCH 07/26] Updates to transcript --- transcripts/scott-davis-transcript.md | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index e0d2002c..34ca55a9 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -18,22 +18,22 @@ Data Science Introductions - [X] [R Programming](https://www.coursera.org/account/accomplishments/records/Va5vuEvGKyr7UyHEL) - [X] [Getting and Cleaning Data](https://www.coursera.org/account/accomplishments/records/ENSGmvNfx24sANRW) - [X] [Exploratory Data Analysis](https://www.coursera.org/account/accomplishments/records/2PPsRu2Us3sUehBQ) - - [ ] [Reproducible Research] (in progress) + - [X] [Reproducible Research] - [ ] [Statistical Inference] - [ ] [Regression Models] - - [ ] [Practical Machine Learning] (in progress) + - [X] [Practical Machine Learning] - [ ] [Developing Data Products] - [ ] [Data Science Capstone] - [ ] [Data Science by Harvard](http://cs109.github.io/2015/) (online course) - [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) -- [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) +- [ ] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) - [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - in Excel, but also works in LibreOffice and so much of business analytics is still in Excel.

Mathematics/Statistics

- [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) - [ ] [Statistics for Spatio-Temporal Data](http://www.wiley.com/WileyCDA/WileyTitle/productCd-EHEP002348.html) - [ ] [Linear Algebra](http://www.amazon.com/Linear-Algebra-Dover-Books-Mathematics/dp/048663518X) - - [ ] Problem-Solving Heuristics: [How to Solve It](http://www.amazon.com/How-Solve-It-Mathematical-Princeton/dp/069111966X) + - [X] Problem-Solving Heuristics: [How to Solve It](http://www.amazon.com/How-Solve-It-Mathematical-Princeton/dp/069111966X)

Computing

R: @@ -55,8 +55,8 @@ Python: - [ ] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) QGIS: - - [ ] [QGIS Tutorials and Tips](http://www.qgistutorials.com/en/) - - [ ] [Mastering QGIS](https://www.packtpub.com/application-development/mastering-qgis) + - [X] [QGIS Tutorials and Tips](http://www.qgistutorials.com/en/) + - [X] [Mastering QGIS](https://www.packtpub.com/application-development/mastering-qgis) - [ ] [Building Mapping Applications with QGIS](https://www.packtpub.com/application-development/building-mapping-applications-qgis) MySQL: From d22bab895c23096bcf15e8f171386e4dcc8d7981 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Mon, 18 Apr 2016 18:39:45 -0500 Subject: [PATCH 08/26] additional updates, fixed some formatting --- transcripts/scott-davis-transcript.md | 7 +++++-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 34ca55a9..35a0e0b9 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -27,12 +27,12 @@ Data Science Introductions - [ ] [Data Science by Harvard](http://cs109.github.io/2015/) (online course) - [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) - [ ] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) -- [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - in Excel, but also works in LibreOffice and so much of business analytics is still in Excel. +- [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses.

Mathematics/Statistics

- [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) - [ ] [Statistics for Spatio-Temporal Data](http://www.wiley.com/WileyCDA/WileyTitle/productCd-EHEP002348.html) - - [ ] [Linear Algebra](http://www.amazon.com/Linear-Algebra-Dover-Books-Mathematics/dp/048663518X) + - [ ] [Linear Programming: An Introduction With Applications (Second Edition)](http://www.amazon.com/Linear-Programming-Introduction-Applications-Edition/dp/1463543670?ie=UTF8&psc=1&redirect=true&ref_=oh_aui_detailpage_o01_s00) - [X] Problem-Solving Heuristics: [How to Solve It](http://www.amazon.com/How-Solve-It-Mathematical-Princeton/dp/069111966X)

Computing

@@ -58,6 +58,9 @@ QGIS: - [X] [QGIS Tutorials and Tips](http://www.qgistutorials.com/en/) - [X] [Mastering QGIS](https://www.packtpub.com/application-development/mastering-qgis) - [ ] [Building Mapping Applications with QGIS](https://www.packtpub.com/application-development/building-mapping-applications-qgis) + - [X] [GIS Tutorial Workbook 1](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=232&moduleID=1) This is for ArcView, but you can work the examples in QGIS too + - [ ] [GIS Tutorial Workbook 2: Spatial Analysis](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=230&moduleID=0) This is for ArcView, but you can work the examples in QGIS too + - [ ] [QGIS Map Design](https://locatepress.com/qmd) I've just thumbed through this, but it's beautiful and belongs on any list of GIS books. MySQL: - [ ] [Learn MySQL in One Video](https://www.youtube.com/watch?v=yPu6qV5byu4) From de7679edd89294354ccc2f4aec30ae73404e4dde Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Mon, 18 Apr 2016 18:41:31 -0500 Subject: [PATCH 09/26] updates --- transcripts/scott-davis-transcript.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 35a0e0b9..157b4285 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -1,7 +1,7 @@

Scott Davis Transcript

Open Source Data Science Masters

-I'm going to have some time for indepedent study this year so I plan to work through as much as possible. I work in the real estate industry and we have so much data that isn't used for meaningful analysis and the tools, though readily available, haven't caught up for the needs of real estate users. That's what I'm interested in working on. I use a lot of GIS and R, so my curriculum is tailored to follow [R](https://www.r-project.org/)/[Python](www.python.org) and [QGIS](www.qgis.org). I'm a bit of an open-source nut so I like learning much better this way. I'm looking for people to connect with, and possibly to work on projects. +I'm going to have some time for indepedent study this year so I plan to work through as much as possible. I work in the real estate industry and we have so much data that isn't used for meaningful analysis and the tools, though readily available, haven't caught up for the needs of real estate users. That's what I'm interested in working on. I use a lot of GIS and R, so my curriculum is tailored to follow [R](https://www.r-project.org/)/[Python](www.python.org) and [QGIS](www.qgis.org). I'm a bit of an open-source nut so I like learning much better this way. I'm looking for people to connect with, and possibly to work on projects. Also, maybe not technically purely open source as I've used a lot of books - which I've linked to here. Want to collaborate? Get in touch: * [linkedin](http://www.linkedin.com/in/scottcdavis); From 386dce9f2761d09d5697b697aa82e0a2074595dd Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Tue, 19 Apr 2016 08:44:11 -0500 Subject: [PATCH 10/26] corrected a link --- transcripts/scott-davis-transcript.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 157b4285..854444e0 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -19,14 +19,14 @@ Data Science Introductions - [X] [Getting and Cleaning Data](https://www.coursera.org/account/accomplishments/records/ENSGmvNfx24sANRW) - [X] [Exploratory Data Analysis](https://www.coursera.org/account/accomplishments/records/2PPsRu2Us3sUehBQ) - [X] [Reproducible Research] - - [ ] [Statistical Inference] - - [ ] [Regression Models] + - [ ] [Statistical Inference] in progress + - [ ] [Regression Models] in progress - [X] [Practical Machine Learning] - [ ] [Developing Data Products] - [ ] [Data Science Capstone] - [ ] [Data Science by Harvard](http://cs109.github.io/2015/) (online course) - [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) -- [ ] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) +- [X] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) - [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses.

Mathematics/Statistics

@@ -118,7 +118,7 @@ PostGIS/PostGRESQL: - [ ] [Design Elements: A Graphical Style Manual](http://www.amazon.com/Design-Elements-Graphic-Style-Manual/dp/1592532616) - [ ] [Storytelling with Data] (http://www.amazon.com/gp/product/1119002257?psc=1&redirect=true&ref_=oh_aui_detailpage_o09_s00) - [ ] [Mastering Python Data Visualization](https://www.packtpub.com/big-data-and-business-intelligence/mastering-python-data-visualization) - - [ ] [The Grammar of Graphics](https://www.packtpub.com/big-data-and-business-intelligence/mastering-python-data-visualization) + - [ ] [The Grammar of Graphics](http://www.springer.com/us/book/9780387245447) - [ ] [R Graphics Cookbook](http://shop.oreilly.com/product/9780596809164.do)

Relevant prior studies

From aef621d45ef3a5305e180ac4d16df40305ecb82b Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Fri, 22 Apr 2016 18:15:09 -0500 Subject: [PATCH 11/26] Updated with new algorithm certificaiton --- transcripts/scott-davis-transcript.md | 15 ++++++++++----- 1 file changed, 10 insertions(+), 5 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 854444e0..218c0a7e 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -18,16 +18,15 @@ Data Science Introductions - [X] [R Programming](https://www.coursera.org/account/accomplishments/records/Va5vuEvGKyr7UyHEL) - [X] [Getting and Cleaning Data](https://www.coursera.org/account/accomplishments/records/ENSGmvNfx24sANRW) - [X] [Exploratory Data Analysis](https://www.coursera.org/account/accomplishments/records/2PPsRu2Us3sUehBQ) - - [X] [Reproducible Research] + - [X] [Reproducible Research](https://www.coursera.org/account/accomplishments/certificate/YRP8NLFYPCV9) - [ ] [Statistical Inference] in progress - [ ] [Regression Models] in progress - - [X] [Practical Machine Learning] + - [X] [Practical Machine Learning](https://www.coursera.org/account/accomplishments/certificate/AJJS85KTU6GZ) - [ ] [Developing Data Products] - [ ] [Data Science Capstone] -- [ ] [Data Science by Harvard](http://cs109.github.io/2015/) (online course) - [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) - [X] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) -- [ ] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses. +- [X] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses.

Mathematics/Statistics

- [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) @@ -75,7 +74,13 @@ PostGIS/PostGRESQL: - [ ] [PostgreSQL: Up and Running: A Practical Introduction to the Advanced Open Source Database](http://shop.oreilly.com/product/0636920032144.do)

Algorithms

- - [ ] [Algorithms Design & Analysis](http://openclassroom.stanford.edu/MainFolder/CoursePage.php?course=IntroToAlgorithms) Stanford openclassroom +- [ ] Data Structures and Algorithms by UCSD / Coursera + - [ ] [Algorithmic Toolbox] in progress + - [ ] [Data Structures] + - [ ] [Algorithms on Graphs and Trees] + - [ ] [Algorithms on Strings] + - [ ] [Advanced Algorithms and Complexity] + - [ ] [Assembling Genomes and Finding Disease-Causing Mutations]

Distributed Computing Paradigms

- [ ] Intro to Hadoop and MapReduce by Cloudera and Udacity From 5590047bcfe3f68e1221f5826e192d33ff9cdc6a Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Thu, 28 Apr 2016 22:35:20 -0500 Subject: [PATCH 12/26] Update with edx classes --- transcripts/scott-davis-transcript.md | 22 ++++++++++++---------- 1 file changed, 12 insertions(+), 10 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 218c0a7e..2614e5ed 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -12,7 +12,10 @@ Want to collaborate? Get in touch:

Open Source Curriculum

Base Introduction

Data Science Introductions - - [ ] Intro to Data Science by UW / Coursera, online course +- [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) +- [ ] [Data Science from Scratch](http://shop.oreilly.com/product/0636920033400.do) +- [X] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) +- [X] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses. - [ ] Data Science Specialization by Johns Hopkins / Coursera - [X] [Data Scientists Toolbox](https://www.coursera.org/account/accomplishments/certificate/UY4EBM46HL) - [X] [R Programming](https://www.coursera.org/account/accomplishments/records/Va5vuEvGKyr7UyHEL) @@ -24,9 +27,7 @@ Data Science Introductions - [X] [Practical Machine Learning](https://www.coursera.org/account/accomplishments/certificate/AJJS85KTU6GZ) - [ ] [Developing Data Products] - [ ] [Data Science Capstone] -- [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) -- [X] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) -- [X] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses. +

Mathematics/Statistics

- [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) @@ -50,6 +51,7 @@ R Libraries/Task Views Python: - [ ] [Dive Into Python](http://www.diveintopython.net/) - [ ] [Google's Python Class](code.google.com/edu/languages/google-python-class/) + - [ ] [Introduction to Python for Data Science - edx](https://courses.edx.org/courses/course-v1:Microsoft+DAT208x+2T2016/info) - [ ] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) - [ ] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) @@ -82,22 +84,22 @@ PostGIS/PostGRESQL: - [ ] [Advanced Algorithms and Complexity] - [ ] [Assembling Genomes and Finding Disease-Causing Mutations] -

Distributed Computing Paradigms

- - [ ] Intro to Hadoop and MapReduce by Cloudera and Udacity -*Note: I might swap the above course with an EdX course on Apache Spark and distributed computing* +

Disributed Computing

+ - [ ] Introduction to Spark, edx + - [ ] Machine Learning with Spark, edx

Data Mining

- [ ] Mining Massive Data Sets, by Stanford and Coursera - [ ] [Clean Data] (https://www.packtpub.com/big-data-and-business-intelligence/clean-data)

Machine Learning/Predictive Analytics - Foundational/Theoretical/Practical

- - [ ] Machine Learning, by Ng Stanford and Coursera (NB this class requires a lot of higher level math) + - [ ] [Statistical Learning with Trevor Hastie and Robert Tibshirani](http://www.r-bloggers.com/in-depth-introduction-to-machine-learning-in-15-hours-of-expert-videos/) - [ ] [An Introduction to Statistical Learning with Applications in R](http://www.r-bloggers.com/in-depth-introduction-to-machine-learning-in-15-hours-of-expert-videos/) (by the authors of The Elements of Statistical Learning at Stanford.) - [ ] [Machine Learning with R](https://www.packtpub.com/big-data-and-business-intelligence/machine-learning-r-second-edition) - [ ] [Building a Recommendation System in R](https://www.packtpub.com/big-data-and-business-intelligence/building-recommendation-system-r) - [ ] [Mastering Predictive Analytics in R](https://www.packtpub.com/application-development/mastering-predictive-analytics-r) - [ ] [Bootstrapping Machine Learning](http://www.louisdorard.com/machine-learning-book/) - - [ ] [Applied Predictive Modeling] (http://www.amazon.com/gp/product/1461468485?psc=1&redirect=true&ref_=oh_aui_detailpage_o08_s00) + - [ ] [Applied Predictive Modeling](http://www.amazon.com/gp/product/1461468485?psc=1&redirect=true&ref_=oh_aui_detailpage_o08_s00)

Analysis

- [ ] [Practical Data Science Cookbook](http://www.diveintopython.net/) @@ -112,7 +114,7 @@ PostGIS/PostGRESQL: - [ ] [Gravity and Spatial Interaction Models (Scientific Geography Series)](http://www.amazon.com/gp/product/0803925441?psc=1&redirect=true&ref_=oh_aui_detailpage_o06_s00) - [ ] [TRANUS Model](http://www.tranus.com/tranus-english) - [ ] [Urban Sim](https://pypi.python.org/pypi/urbansim) - - [ ] [Huff-tools Package in R] (http://rstudio-pubs-static.s3.amazonaws.com/42357_1e6fcc5bcfec439096eb86a106ebf22e.html) + - [ ] [Huff-tools Package in R](http://rstudio-pubs-static.s3.amazonaws.com/42357_1e6fcc5bcfec439096eb86a106ebf22e.html) -

Data Design/Data Viz

- [ ] [Beautiful Evidence](http://www.edwardtufte.com/tufte/books_be) From 7be0b77b0de3fafe7905fbc8311c39cb6613e0d0 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Mon, 2 May 2016 16:08:09 -0500 Subject: [PATCH 13/26] Updated with websites, along with completions to date --- transcripts/scott-davis-transcript.md | 13 +++++++------ 1 file changed, 7 insertions(+), 6 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 2614e5ed..8baa8b60 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -51,7 +51,7 @@ R Libraries/Task Views Python: - [ ] [Dive Into Python](http://www.diveintopython.net/) - [ ] [Google's Python Class](code.google.com/edu/languages/google-python-class/) - - [ ] [Introduction to Python for Data Science - edx](https://courses.edx.org/courses/course-v1:Microsoft+DAT208x+2T2016/info) + - [X] [Introduction to Python for Data Science - edx](https://courses.edx.org/courses/course-v1:Microsoft+DAT208x+2T2016/info) - [ ] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) - [ ] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) @@ -98,16 +98,17 @@ PostGIS/PostGRESQL: - [ ] [Machine Learning with R](https://www.packtpub.com/big-data-and-business-intelligence/machine-learning-r-second-edition) - [ ] [Building a Recommendation System in R](https://www.packtpub.com/big-data-and-business-intelligence/building-recommendation-system-r) - [ ] [Mastering Predictive Analytics in R](https://www.packtpub.com/application-development/mastering-predictive-analytics-r) - - [ ] [Bootstrapping Machine Learning](http://www.louisdorard.com/machine-learning-book/) + - [X] [Bootstrapping Machine Learning](http://www.louisdorard.com/machine-learning-book/) - [ ] [Applied Predictive Modeling](http://www.amazon.com/gp/product/1461468485?psc=1&redirect=true&ref_=oh_aui_detailpage_o08_s00)

Analysis

- - [ ] [Practical Data Science Cookbook](http://www.diveintopython.net/) - - [ ] [R Data Analysis Cookbook](code.google.com/edu/languages/google-python-class/) + - [ ] [Practical Data Science Cookbook](https://www.packtpub.com/big-data-and-business-intelligence/practical-data-science-cookbook) + - [ ] [R Data Analysis Cookbook](http://www.amazon.com/Data-Analysis-Cookbook-Recipes-Deliver/dp/1783989068)

Spatial Analysis

- - [ ] [An Introduction to R for Spatial Analysis and Mapping](http://www.edwardtufte.com/tufte/books_be) + - [ ] [An Introduction to R for Spatial Analysis and Mapping](https://uk.sagepub.com/en-gb/eur/an-introduction-to-r-for-spatial-analysis-and-mapping/book241031) - [ ] [Applied Spatial Data Analysis with R](http://www.springer.com/us/book/9781461476177) + - [ ] [Geospatial Analysis - 5th Edition, 2015 - de Smith, Goodchild, Longley](http://www.spatialanalysisonline.com/HTML/index.html)

Land Use/Transport/Gravity Modeling

- [ ] [Integrated Land Use and Transport Modelling: Decision Chains and Hierarchies](http://www.amazon.com/gp/product/0521022177?psc=1&redirect=true&ref_=oh_aui_detailpage_o03_s00) @@ -123,7 +124,7 @@ PostGIS/PostGRESQL: - [ ] [The Visual Display of Quantitative Information](http://www.edwardtufte.com/tufte/books_vdqi) - [ ] [Design for Information](http://isabelmeirelles.com/book-design-for-information/) - [ ] [Design Elements: A Graphical Style Manual](http://www.amazon.com/Design-Elements-Graphic-Style-Manual/dp/1592532616) - - [ ] [Storytelling with Data] (http://www.amazon.com/gp/product/1119002257?psc=1&redirect=true&ref_=oh_aui_detailpage_o09_s00) + - [X] [Storytelling with Data](http://www.amazon.com/gp/product/1119002257?psc=1&redirect=true&ref_=oh_aui_detailpage_o09_s00) - [ ] [Mastering Python Data Visualization](https://www.packtpub.com/big-data-and-business-intelligence/mastering-python-data-visualization) - [ ] [The Grammar of Graphics](http://www.springer.com/us/book/9780387245447) - [ ] [R Graphics Cookbook](http://shop.oreilly.com/product/9780596809164.do) From ee559df5eadd2cede68c7c1463360170ca527eb9 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Sat, 7 May 2016 08:22:08 -0500 Subject: [PATCH 14/26] Updated with course completions --- transcripts/scott-davis-transcript.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 8baa8b60..27fc7a96 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -22,10 +22,10 @@ Data Science Introductions - [X] [Getting and Cleaning Data](https://www.coursera.org/account/accomplishments/records/ENSGmvNfx24sANRW) - [X] [Exploratory Data Analysis](https://www.coursera.org/account/accomplishments/records/2PPsRu2Us3sUehBQ) - [X] [Reproducible Research](https://www.coursera.org/account/accomplishments/certificate/YRP8NLFYPCV9) - - [ ] [Statistical Inference] in progress - - [ ] [Regression Models] in progress + - [X] [Statistical Inference](https://www.coursera.org/account/accomplishments/records/9733QCP94GEF) + - [X] [Regression Models](https://www.coursera.org/account/accomplishments/records/PP8SKS7CPSDC) - [X] [Practical Machine Learning](https://www.coursera.org/account/accomplishments/certificate/AJJS85KTU6GZ) - - [ ] [Developing Data Products] + - [ ] [Developing Data Products] in progress - [ ] [Data Science Capstone] From 2c84ef2ef5b8238097a8f91ffbfd1054d9693a41 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Wed, 11 May 2016 19:33:43 -0500 Subject: [PATCH 15/26] Updated completions --- transcripts/scott-davis-transcript.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 27fc7a96..d35e270c 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -50,7 +50,7 @@ R Libraries/Task Views Python: - [ ] [Dive Into Python](http://www.diveintopython.net/) - - [ ] [Google's Python Class](code.google.com/edu/languages/google-python-class/) + - [X] [Google's Python Class](code.google.com/edu/languages/google-python-class/) - [X] [Introduction to Python for Data Science - edx](https://courses.edx.org/courses/course-v1:Microsoft+DAT208x+2T2016/info) - [ ] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) - [ ] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) From 208dd78ab0b12cd162a8f3770577103d2dcb5be1 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Mon, 23 May 2016 15:46:04 -0500 Subject: [PATCH 16/26] Completed Developing Data products class --- transcripts/scott-davis-transcript.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index d35e270c..dd961653 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -25,7 +25,7 @@ Data Science Introductions - [X] [Statistical Inference](https://www.coursera.org/account/accomplishments/records/9733QCP94GEF) - [X] [Regression Models](https://www.coursera.org/account/accomplishments/records/PP8SKS7CPSDC) - [X] [Practical Machine Learning](https://www.coursera.org/account/accomplishments/certificate/AJJS85KTU6GZ) - - [ ] [Developing Data Products] in progress + - [X] [Developing Data Products](https://www.coursera.org/account/accomplishments/certificate/6QREL457PPKE) - [ ] [Data Science Capstone] From e47f6998952591e75f536c3ce47a13b43244b10a Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Mon, 23 May 2016 15:53:40 -0500 Subject: [PATCH 17/26] updated with edx materials --- transcripts/scott-davis-transcript.md | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index dd961653..62733735 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -86,7 +86,7 @@ PostGIS/PostGRESQL:

Disributed Computing

- [ ] Introduction to Spark, edx - - [ ] Machine Learning with Spark, edx + - [ ] Distributed Machine Learning with Apache Spark, edx

Data Mining

- [ ] Mining Massive Data Sets, by Stanford and Coursera @@ -116,7 +116,8 @@ PostGIS/PostGRESQL: - [ ] [TRANUS Model](http://www.tranus.com/tranus-english) - [ ] [Urban Sim](https://pypi.python.org/pypi/urbansim) - [ ] [Huff-tools Package in R](http://rstudio-pubs-static.s3.amazonaws.com/42357_1e6fcc5bcfec439096eb86a106ebf22e.html) - - + - [ ] [Big Data for Smart Cities](https://courses.edx.org/courses/course-v1:IEEEx+IntroData.x+2016_T3/info) +

Data Design/Data Viz

- [ ] [Beautiful Evidence](http://www.edwardtufte.com/tufte/books_be) - [ ] [Semiology of Graphics](http://www.amazon.com/Semiology-Graphics-Diagrams-Networks-Maps/dp/1589482611) From 791674ea3e83fb5d07a384f7795a23f7223c38cd Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Mon, 23 May 2016 19:47:48 -0500 Subject: [PATCH 18/26] Finished webscraping with python --- transcripts/scott-davis-transcript.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 62733735..20c2d405 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -53,7 +53,7 @@ Python: - [X] [Google's Python Class](code.google.com/edu/languages/google-python-class/) - [X] [Introduction to Python for Data Science - edx](https://courses.edx.org/courses/course-v1:Microsoft+DAT208x+2T2016/info) - [ ] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) - - [ ] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) + - [X] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) QGIS: - [X] [QGIS Tutorials and Tips](http://www.qgistutorials.com/en/) From 5cf800e872bcaccecda901b03cbf9da2e82f4dae Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Tue, 24 May 2016 14:09:15 -0500 Subject: [PATCH 19/26] finished data science from scratch --- transcripts/scott-davis-transcript.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 20c2d405..9ce60026 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -13,7 +13,7 @@ Want to collaborate? Get in touch:

Base Introduction

Data Science Introductions - [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) -- [ ] [Data Science from Scratch](http://shop.oreilly.com/product/0636920033400.do) +- [X] [Data Science from Scratch](http://shop.oreilly.com/product/0636920033400.do) - [X] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) - [X] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses. - [ ] Data Science Specialization by Johns Hopkins / Coursera From bd079aa12611bbaa8936a764bc9ae3b1b7fcf4c2 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Mon, 13 Jun 2016 14:08:45 -0500 Subject: [PATCH 20/26] Updates for completions Completions for data science specialization Some Python completions Deleted some algorithm classes and added more geospatial --- transcripts/scott-davis-transcript.md | 24 ++++++++++++------------ 1 file changed, 12 insertions(+), 12 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 9ce60026..ea4b82b5 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -16,7 +16,7 @@ Data Science Introductions - [X] [Data Science from Scratch](http://shop.oreilly.com/product/0636920033400.do) - [X] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) - [X] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses. - - [ ] Data Science Specialization by Johns Hopkins / Coursera + - [X] Data Science Specialization by Johns Hopkins / Coursera - [X] [Data Scientists Toolbox](https://www.coursera.org/account/accomplishments/certificate/UY4EBM46HL) - [X] [R Programming](https://www.coursera.org/account/accomplishments/records/Va5vuEvGKyr7UyHEL) - [X] [Getting and Cleaning Data](https://www.coursera.org/account/accomplishments/records/ENSGmvNfx24sANRW) @@ -26,13 +26,13 @@ Data Science Introductions - [X] [Regression Models](https://www.coursera.org/account/accomplishments/records/PP8SKS7CPSDC) - [X] [Practical Machine Learning](https://www.coursera.org/account/accomplishments/certificate/AJJS85KTU6GZ) - [X] [Developing Data Products](https://www.coursera.org/account/accomplishments/certificate/6QREL457PPKE) - - [ ] [Data Science Capstone] + - [X] [Data Science Capstone]

Mathematics/Statistics

- [ ] [Statistics for Spatial Data, Revised Edition](http://www.wiley.com/WileyCDA/WileyTitle/productCd-1119114616.html) - [ ] [Statistics for Spatio-Temporal Data](http://www.wiley.com/WileyCDA/WileyTitle/productCd-EHEP002348.html) - - [ ] [Linear Programming: An Introduction With Applications (Second Edition)](http://www.amazon.com/Linear-Programming-Introduction-Applications-Edition/dp/1463543670?ie=UTF8&psc=1&redirect=true&ref_=oh_aui_detailpage_o01_s00) + - [X] [Linear Programming: An Introduction With Applications (Second Edition)](http://www.amazon.com/Linear-Programming-Introduction-Applications-Edition/dp/1463543670?ie=UTF8&psc=1&redirect=true&ref_=oh_aui_detailpage_o01_s00) - [X] Problem-Solving Heuristics: [How to Solve It](http://www.amazon.com/How-Solve-It-Mathematical-Princeton/dp/069111966X)

Computing

@@ -49,7 +49,7 @@ R Libraries/Task Views * Finance [CRAN Task View: Empirical Finance](https://cran.r-project.org/web/views/Finance.html) Python: - - [ ] [Dive Into Python](http://www.diveintopython.net/) + - [X] [Dive Into Python](http://www.diveintopython.net/) - [X] [Google's Python Class](code.google.com/edu/languages/google-python-class/) - [X] [Introduction to Python for Data Science - edx](https://courses.edx.org/courses/course-v1:Microsoft+DAT208x+2T2016/info) - [ ] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) @@ -64,7 +64,7 @@ QGIS: - [ ] [QGIS Map Design](https://locatepress.com/qmd) I've just thumbed through this, but it's beautiful and belongs on any list of GIS books. MySQL: - - [ ] [Learn MySQL in One Video](https://www.youtube.com/watch?v=yPu6qV5byu4) + - [X] [Learn MySQL in One Video](https://www.youtube.com/watch?v=yPu6qV5byu4) - [ ] [MySQL Workbench Starter](code.google.com/edu/languages/google-python-class/) Octave: @@ -78,15 +78,12 @@ PostGIS/PostGRESQL:

Algorithms

- [ ] Data Structures and Algorithms by UCSD / Coursera - [ ] [Algorithmic Toolbox] in progress - - [ ] [Data Structures] - - [ ] [Algorithms on Graphs and Trees] - - [ ] [Algorithms on Strings] - - [ ] [Advanced Algorithms and Complexity] - - [ ] [Assembling Genomes and Finding Disease-Causing Mutations] +

Disributed Computing

- [ ] Introduction to Spark, edx - [ ] Distributed Machine Learning with Apache Spark, edx + - [ ] [Big Data for Smart Cities](https://courses.edx.org/courses/course-v1:IEEEx+IntroData.x+2016_T3/info)

Data Mining

- [ ] Mining Massive Data Sets, by Stanford and Coursera @@ -104,11 +101,14 @@ PostGIS/PostGRESQL:

Analysis

- [ ] [Practical Data Science Cookbook](https://www.packtpub.com/big-data-and-business-intelligence/practical-data-science-cookbook) - [ ] [R Data Analysis Cookbook](http://www.amazon.com/Data-Analysis-Cookbook-Recipes-Deliver/dp/1783989068) + - [ ] [Python Data Science Essentials](https://www.packtpub.com/big-data-and-business-intelligence/python-data-science-essentials)

Spatial Analysis

- [ ] [An Introduction to R for Spatial Analysis and Mapping](https://uk.sagepub.com/en-gb/eur/an-introduction-to-r-for-spatial-analysis-and-mapping/book241031) - [ ] [Applied Spatial Data Analysis with R](http://www.springer.com/us/book/9781461476177) - [ ] [Geospatial Analysis - 5th Edition, 2015 - de Smith, Goodchild, Longley](http://www.spatialanalysisonline.com/HTML/index.html) + - [ ] [Learning Geospatial Analysis with Python](https://www.packtpub.com/application-development/learning-geospatial-analysis-python) + - [ ] [Python Geospatial Development - Second Edition](https://www.packtpub.com/application-development/python-geospatial-development-second-edition)

Land Use/Transport/Gravity Modeling

- [ ] [Integrated Land Use and Transport Modelling: Decision Chains and Hierarchies](http://www.amazon.com/gp/product/0521022177?psc=1&redirect=true&ref_=oh_aui_detailpage_o03_s00) @@ -116,7 +116,7 @@ PostGIS/PostGRESQL: - [ ] [TRANUS Model](http://www.tranus.com/tranus-english) - [ ] [Urban Sim](https://pypi.python.org/pypi/urbansim) - [ ] [Huff-tools Package in R](http://rstudio-pubs-static.s3.amazonaws.com/42357_1e6fcc5bcfec439096eb86a106ebf22e.html) - - [ ] [Big Data for Smart Cities](https://courses.edx.org/courses/course-v1:IEEEx+IntroData.x+2016_T3/info) +

Data Design/Data Viz

- [ ] [Beautiful Evidence](http://www.edwardtufte.com/tufte/books_be) @@ -128,7 +128,7 @@ PostGIS/PostGRESQL: - [X] [Storytelling with Data](http://www.amazon.com/gp/product/1119002257?psc=1&redirect=true&ref_=oh_aui_detailpage_o09_s00) - [ ] [Mastering Python Data Visualization](https://www.packtpub.com/big-data-and-business-intelligence/mastering-python-data-visualization) - [ ] [The Grammar of Graphics](http://www.springer.com/us/book/9780387245447) - - [ ] [R Graphics Cookbook](http://shop.oreilly.com/product/9780596809164.do) + - [X] [R Graphics Cookbook](http://shop.oreilly.com/product/9780596809164.do)

Relevant prior studies

- [X] MS in Community and Regional Planning, UT-Austin From 5c4ef33ec086d907cfc15d6a4218ae4a9a185832 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Sun, 17 Jul 2016 15:42:08 -0500 Subject: [PATCH 21/26] Added completion of coursera data science --- transcripts/scott-davis-transcript.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index ea4b82b5..71d51a2d 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -16,7 +16,7 @@ Data Science Introductions - [X] [Data Science from Scratch](http://shop.oreilly.com/product/0636920033400.do) - [X] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) - [X] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses. - - [X] Data Science Specialization by Johns Hopkins / Coursera + - [X] [Data Science Specialization by Johns Hopkins / Coursera](https://www.coursera.org/account/accomplishments/specialization/3WN77YYQ7QK7) - [X] [Data Scientists Toolbox](https://www.coursera.org/account/accomplishments/certificate/UY4EBM46HL) - [X] [R Programming](https://www.coursera.org/account/accomplishments/records/Va5vuEvGKyr7UyHEL) - [X] [Getting and Cleaning Data](https://www.coursera.org/account/accomplishments/records/ENSGmvNfx24sANRW) @@ -26,7 +26,7 @@ Data Science Introductions - [X] [Regression Models](https://www.coursera.org/account/accomplishments/records/PP8SKS7CPSDC) - [X] [Practical Machine Learning](https://www.coursera.org/account/accomplishments/certificate/AJJS85KTU6GZ) - [X] [Developing Data Products](https://www.coursera.org/account/accomplishments/certificate/6QREL457PPKE) - - [X] [Data Science Capstone] + - [X] [Data Science Capstone](https://www.coursera.org/account/accomplishments/certificate/A9M48VWHBAMT)

Mathematics/Statistics

From ef353f9685115ca1c59adc8e98374222c5c91415 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Sat, 20 Aug 2016 14:29:33 -0500 Subject: [PATCH 22/26] updated with algorithmic toolbox completion --- transcripts/scott-davis-transcript.md | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 71d51a2d..f8625e2f 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -76,8 +76,8 @@ PostGIS/PostGRESQL: - [ ] [PostgreSQL: Up and Running: A Practical Introduction to the Advanced Open Source Database](http://shop.oreilly.com/product/0636920032144.do)

Algorithms

-- [ ] Data Structures and Algorithms by UCSD / Coursera - - [ ] [Algorithmic Toolbox] in progress +- [ ] Data Structures and Algorithms by UCSD / Coursera [Decided not to take the balance of the specialization) + - [X] [Algorithmic Toolbox] in progress (https://www.coursera.org/account/accomplishments/certificate/RUKKXTCFDAPV)

Disributed Computing

@@ -100,8 +100,8 @@ PostGIS/PostGRESQL:

Analysis

- [ ] [Practical Data Science Cookbook](https://www.packtpub.com/big-data-and-business-intelligence/practical-data-science-cookbook) - - [ ] [R Data Analysis Cookbook](http://www.amazon.com/Data-Analysis-Cookbook-Recipes-Deliver/dp/1783989068) - - [ ] [Python Data Science Essentials](https://www.packtpub.com/big-data-and-business-intelligence/python-data-science-essentials) + - [X] [R Data Analysis Cookbook](http://www.amazon.com/Data-Analysis-Cookbook-Recipes-Deliver/dp/1783989068) + - [X] [Python Data Science Essentials](https://www.packtpub.com/big-data-and-business-intelligence/python-data-science-essentials)

Spatial Analysis

- [ ] [An Introduction to R for Spatial Analysis and Mapping](https://uk.sagepub.com/en-gb/eur/an-introduction-to-r-for-spatial-analysis-and-mapping/book241031) From 8397e0ebd33a581cc8ccf91ca57ad68a602c9067 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Tue, 30 Aug 2016 22:38:08 -0500 Subject: [PATCH 23/26] updated with some additional books finished. --- transcripts/scott-davis-transcript.md | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index f8625e2f..5a634562 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -52,7 +52,7 @@ Python: - [X] [Dive Into Python](http://www.diveintopython.net/) - [X] [Google's Python Class](code.google.com/edu/languages/google-python-class/) - [X] [Introduction to Python for Data Science - edx](https://courses.edx.org/courses/course-v1:Microsoft+DAT208x+2T2016/info) - - [ ] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) + - [X] [Python for Data Analysis](http://shop.oreilly.com/product/0636920023784.do) - [X] [Webscraping with Python](https://www.packtpub.com/big-data-and-business-intelligence/web-scraping-python) QGIS: @@ -107,8 +107,8 @@ PostGIS/PostGRESQL: - [ ] [An Introduction to R for Spatial Analysis and Mapping](https://uk.sagepub.com/en-gb/eur/an-introduction-to-r-for-spatial-analysis-and-mapping/book241031) - [ ] [Applied Spatial Data Analysis with R](http://www.springer.com/us/book/9781461476177) - [ ] [Geospatial Analysis - 5th Edition, 2015 - de Smith, Goodchild, Longley](http://www.spatialanalysisonline.com/HTML/index.html) - - [ ] [Learning Geospatial Analysis with Python](https://www.packtpub.com/application-development/learning-geospatial-analysis-python) - - [ ] [Python Geospatial Development - Second Edition](https://www.packtpub.com/application-development/python-geospatial-development-second-edition) + - [X] [Learning Geospatial Analysis with Python](https://www.packtpub.com/application-development/learning-geospatial-analysis-python) + - [X] [Python Geospatial Development - Second Edition](https://www.packtpub.com/application-development/python-geospatial-development-second-edition)

Land Use/Transport/Gravity Modeling

- [ ] [Integrated Land Use and Transport Modelling: Decision Chains and Hierarchies](http://www.amazon.com/gp/product/0521022177?psc=1&redirect=true&ref_=oh_aui_detailpage_o03_s00) From 732cc156e4a11602e0889275d723d0f9ebaabd27 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Thu, 8 Sep 2016 08:51:01 -0500 Subject: [PATCH 24/26] updated with book completions --- transcripts/scott-davis-transcript.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index 5a634562..ad8e379b 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -60,7 +60,7 @@ QGIS: - [X] [Mastering QGIS](https://www.packtpub.com/application-development/mastering-qgis) - [ ] [Building Mapping Applications with QGIS](https://www.packtpub.com/application-development/building-mapping-applications-qgis) - [X] [GIS Tutorial Workbook 1](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=232&moduleID=1) This is for ArcView, but you can work the examples in QGIS too - - [ ] [GIS Tutorial Workbook 2: Spatial Analysis](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=230&moduleID=0) This is for ArcView, but you can work the examples in QGIS too + - [X] [GIS Tutorial Workbook 2: Spatial Analysis](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=230&moduleID=0) This is for ArcView, but you can work the examples in QGIS too - [ ] [QGIS Map Design](https://locatepress.com/qmd) I've just thumbed through this, but it's beautiful and belongs on any list of GIS books. MySQL: From 29b5eefd07af1cb8db13299e38fabd45d92e88e4 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Thu, 15 Sep 2016 14:49:25 -0500 Subject: [PATCH 25/26] updated with book completions --- transcripts/scott-davis-transcript.md | 14 ++++---------- 1 file changed, 4 insertions(+), 10 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index ad8e379b..b72b5dbe 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -12,7 +12,7 @@ Want to collaborate? Get in touch:

Open Source Curriculum

Base Introduction

Data Science Introductions -- [ ] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) +- [X] [Data Science with Open Source Tools](http://shop.oreilly.com/product/9780596802363.do) - [X] [Data Science from Scratch](http://shop.oreilly.com/product/0636920033400.do) - [X] [50 Years of Data Science](http://pages.cs.wisc.edu/~anhai/courses/784-fall15/50YearsDataScience.pdf) - [X] [Datasmart](http://www.amazon.com/Data-Smart-Science-Transform-Information/dp/111866146X/ref=sr_1_1?s=books&ie=UTF8&qid=1458768727&sr=1-1&keywords=datasmart) - This book is a thorough review of using Excel for data science tools. Every aspiring data scientist should work through this book because (1) you'll learn a lot because Excel makes you do every step and (2) you'll realize you need to learn R or python or some other way to do these analyses. @@ -39,7 +39,7 @@ Data Science Introductions R: - [ ] [R in Action](https://www.manning.com/books/r-in-action-second-edition?a_bid=5c2b1e1d&a_aid=RiA2ed) - [ ] [R Cookbook](http://shop.oreilly.com/product/9780596809164.do) - - [ ] [Forecasting: Principles and Practice](http://otexts.com/fpp/) + - [X] [Forecasting: Principles and Practice](http://otexts.com/fpp/) R Libraries/Task Views * [ProjectTemplate](http://projecttemplate.net/index.html) @@ -58,14 +58,14 @@ Python: QGIS: - [X] [QGIS Tutorials and Tips](http://www.qgistutorials.com/en/) - [X] [Mastering QGIS](https://www.packtpub.com/application-development/mastering-qgis) - - [ ] [Building Mapping Applications with QGIS](https://www.packtpub.com/application-development/building-mapping-applications-qgis) + - [X] [Building Mapping Applications with QGIS](https://www.packtpub.com/application-development/building-mapping-applications-qgis) - [X] [GIS Tutorial Workbook 1](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=232&moduleID=1) This is for ArcView, but you can work the examples in QGIS too - [X] [GIS Tutorial Workbook 2: Spatial Analysis](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=230&moduleID=0) This is for ArcView, but you can work the examples in QGIS too - [ ] [QGIS Map Design](https://locatepress.com/qmd) I've just thumbed through this, but it's beautiful and belongs on any list of GIS books. MySQL: - [X] [Learn MySQL in One Video](https://www.youtube.com/watch?v=yPu6qV5byu4) - - [ ] [MySQL Workbench Starter](code.google.com/edu/languages/google-python-class/) + - [X] [MySQL Workbench Starter](code.google.com/edu/languages/google-python-class/) Octave: - [ ] [GNU Octave Beginners Guide](https://www.packtpub.com/big-data-and-business-intelligence/gnu-octave-beginners-guide) @@ -79,12 +79,6 @@ PostGIS/PostGRESQL: - [ ] Data Structures and Algorithms by UCSD / Coursera [Decided not to take the balance of the specialization) - [X] [Algorithmic Toolbox] in progress (https://www.coursera.org/account/accomplishments/certificate/RUKKXTCFDAPV) - -

Disributed Computing

- - [ ] Introduction to Spark, edx - - [ ] Distributed Machine Learning with Apache Spark, edx - - [ ] [Big Data for Smart Cities](https://courses.edx.org/courses/course-v1:IEEEx+IntroData.x+2016_T3/info) -

Data Mining

- [ ] Mining Massive Data Sets, by Stanford and Coursera - [ ] [Clean Data] (https://www.packtpub.com/big-data-and-business-intelligence/clean-data) From f06e81c15c7ae33d2681f13d0e4cf46272c8e254 Mon Sep 17 00:00:00 2001 From: Scott Davis Date: Sat, 17 Sep 2016 08:44:47 -0500 Subject: [PATCH 26/26] updated with some additional books finished. --- transcripts/scott-davis-transcript.md | 7 +++++-- 1 file changed, 5 insertions(+), 2 deletions(-) diff --git a/transcripts/scott-davis-transcript.md b/transcripts/scott-davis-transcript.md index b72b5dbe..d6ba2428 100644 --- a/transcripts/scott-davis-transcript.md +++ b/transcripts/scott-davis-transcript.md @@ -49,6 +49,7 @@ R Libraries/Task Views * Finance [CRAN Task View: Empirical Finance](https://cran.r-project.org/web/views/Finance.html) Python: + - [X] [Jumpstart Python by Building 10 Apps](https://training.talkpython.fm/courses/details/python-language-jumpstart-building-10-apps) This is probably the best introduction to Python that I have seen. - [X] [Dive Into Python](http://www.diveintopython.net/) - [X] [Google's Python Class](code.google.com/edu/languages/google-python-class/) - [X] [Introduction to Python for Data Science - edx](https://courses.edx.org/courses/course-v1:Microsoft+DAT208x+2T2016/info) @@ -58,14 +59,17 @@ Python: QGIS: - [X] [QGIS Tutorials and Tips](http://www.qgistutorials.com/en/) - [X] [Mastering QGIS](https://www.packtpub.com/application-development/mastering-qgis) + - [X] [QGIS 2.0 Cookbook](https://www.packtpub.com/application-development/qgis-2-cookbook) Advanced data management, data visualization and spatial analysis techniques with QGIS. - [X] [Building Mapping Applications with QGIS](https://www.packtpub.com/application-development/building-mapping-applications-qgis) - [X] [GIS Tutorial Workbook 1](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=232&moduleID=1) This is for ArcView, but you can work the examples in QGIS too - [X] [GIS Tutorial Workbook 2: Spatial Analysis](https://esripress.esri.com/display/index.cfm?fuseaction=display&websiteID=230&moduleID=0) This is for ArcView, but you can work the examples in QGIS too + - [ ] QGIS Python Programming Cookbook (https://www.packtpub.com/application-development/qgis-python-programming-cookbook) Automated desktop QGIS processing. - [ ] [QGIS Map Design](https://locatepress.com/qmd) I've just thumbed through this, but it's beautiful and belongs on any list of GIS books. + MySQL: - [X] [Learn MySQL in One Video](https://www.youtube.com/watch?v=yPu6qV5byu4) - - [X] [MySQL Workbench Starter](code.google.com/edu/languages/google-python-class/) + - [X] [MySQL Explained](https://www.ostraining.com/books/mysql/about/) Octave: - [ ] [GNU Octave Beginners Guide](https://www.packtpub.com/big-data-and-business-intelligence/gnu-octave-beginners-guide) @@ -80,7 +84,6 @@ PostGIS/PostGRESQL: - [X] [Algorithmic Toolbox] in progress (https://www.coursera.org/account/accomplishments/certificate/RUKKXTCFDAPV)

Data Mining

- - [ ] Mining Massive Data Sets, by Stanford and Coursera - [ ] [Clean Data] (https://www.packtpub.com/big-data-and-business-intelligence/clean-data)

Machine Learning/Predictive Analytics - Foundational/Theoretical/Practical