Error occured ! We are notified and will try and resolve this as soon as possible.
WARNING! [2] file_put_contents(/home/gelembjuk/domains/myeducationpath.com/app/../html/cache/memory/course_22750_0_e086762d743c0218beb85ea6e1b456cae.txt): failed to open stream: No such file or directory . Line 75 in file /home/gelembjuk/domains/myeducationpath.com/html/include/class.cache.php. Continue execution. 1712625; index.php; 216.73.216.221; GET; url=courses/22750/distributed-machine-learning-with-apache-spark.htm&; ; Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com); ; Executon time: 0 MyEducationPath.com :: edX : Distributed Machine Learning with Apache Spark

Distributed Machine Learning with Apache Spark

0 votes
Free Closed [?]
Distributed Machine Learning with Apache Spark

Machine learning aims to extract knowledge from data, relying on fundamental concepts in computer science, statistics, probability and optimization. Learning algorithms enable a wide range of applications, from everyday tasks such as product recommendations and spam filtering to bleeding edge applications like self-driving cars and personalized medicine. In the age of ‘big data’, with datasets rapidly growing in size and complexity and cloud computing becoming more pervasive, machine learning techniques are fast becoming a core component of large-scale data processing pipelines.



This statistics and data analysis course introduces the underlying statistical and algorithmic principles required to develop scalable real-world machine learning pipelines. We present an integrated view of data processing by highlighting the various components of these pipelines, including exploratory data analysis, feature extraction, supervised learning, and model evaluation. You will gain hands-on experience applying these principles using Spark, a cluster computing system well-suited for large-scale machine learning tasks, and its packages spark.ml and spark.mllib. You will implement distributed algorithms for fundamental statistical models (linear regression, logistic regression, principal component analysis) while tackling key problems from domains such as online advertising and cognitive neuroscience.


Categories:
Starts : 2016-07-11

Comments

Alternatives

-- no alternatives found for the course --
If you know any alternatives, please let us know.

Prerequisites

-- no prerequsites found for the course --
If you can suggest any prerequisite, please let us know.

Paths

No Paths inclusing the course. You can build and share a path with this course included.

Certification Exams

-- there are no exams to get certification after this course --
If your company does certification for those who completed this course then register your company as certification vendor and add your exams to the Exams Directory.

Similar courses

Courses related to the course subject

《Java程序设计》课程是使用Java语言进行应用程序设计的课程,针对各专业的大学本科生开设。课…

4G is the cutting-edge network technology that links millions of smartphones to the internet. But how does it actually work? Ideal…

SQL is the language of all databases, this course offers an easy way to master the SQL fundamentals…

Have you ever wondered how information is transmitted using your mobile phone or a WiFi hotspot? This introductory course seeks to…

Have you ever wondered how information is transmitted using your mobile phone or a WiFi hotspot? This introductory course seeks to…

Have you ever wondered how information is transmitted using your mobile phone or a WiFi hotspot? Gain an understanding of the basi…

DevOps is the union of people, process and products to enable the continuous delivery of value to end users. It is not only about…

En este curso trabajarás la definición, el desarrollo y el cierre de un proyecto de Tecnologías de la Informaci&oac…

This is a graduate course on the design and analysis of algorithms, covering several advanced topics not studied in typical introd…

Gain a deeper understanding of Spark by learning about its APIs, architecture, and common use cases. This statistics and data ana…

Let us know when you did the course Distributed Machine Learning with Apache Spark.

Started on: Completed on:
Your grade (if any):
Comments:

Add the course Distributed Machine Learning with Apache Spark to My Personal Education Path.

Start the course on:
Duration of study:
Notes:

Successfully added to your path.

View your path

Select what exam to connect to the course. The course will be displayed on the exam page in the list of courses supported for certification with the exam.


Notes about how the exam certifies students of the course (optional):