-
Notifications
You must be signed in to change notification settings - Fork 7
Description
https://www.coursera.org/learn/scala-spark-big-data
Several reference for algorithm/python development for Spark:
https://spark.apache.org/docs/1.6.0/ml-guide.html
https://spark.apache.org/docs/1.6.0/api/python/index.html
Spark distributed matrix multiply and pseudo-inverse calculating
https://stackoverflow.com/questions/29869567/spark-distributed-matrix-multiply-and-pseudo-inverse-calculating
Python development:
https://spark.apache.org/docs/latest/programming-guide.html
Official website:
https://spark.apache.org/
Programming guide in all supported languages (Scala, Java, Python, R):
https://spark.apache.org/docs/latest/programming-guide.html
Modules built on Spark:
• Spark Streaming: processing real-time data streams
• Spark SQL, Datasets, and DataFrames: support for structured data and relational queries
• MLlib: built-in machine learning library
• GraphX: Spark’s new API for graph processing
Free online course: “Big Data Analysis with Scala and Spark”
https://www.coursera.org/learn/scala-spark-big-data/
to commit result:
sbt
submit <email> <token>