Big Data Analysis and Machine Learning using PySpark
Optimize Large-Scale Data Processing and Machine Learning with PySpark.
Optimize Large-Scale Data Processing and Machine Learning with PySpark.
This workshop is designed for individuals eager to dive into the world of big data and machine learning using PySpark. It covers the fundamentals of Python and PySpark, providing participants with the tools to manipulate and analyze large-scale data efficiently. By focusing on key PySpark concepts, such as data abstraction (RDD, DataFrame, Dataset) and lazy evaluation, participants will gain the skills necessary to handle, preprocess, and analyze big data effectively.
Throughout the course, participants will engage in hands-on learning with a rich, interactive experience. Our Instructor and two Teaching Assistants will guide participants through the material, offering support and troubleshooting assistance whenever needed.
Upon completion of this workshop, you will be able to:
Introduction to Python and PySpark
Subsetting and Aggregation Data in PySpark
Machine Learning with PySpark
This testimonial video is taken after our previous Online Data Science Series: Time Series Analysis for Business Forecasting.
Our learning format is online-interactive, you will feel the interactive experience as if you were present in a physical classroom. You can access the class using your Zoom account on pre-defined dates.
Workshops in this series are tailored to casual programmers and non-programmers that are taking their first steps into data science. It assumes no prior knowledge or academic background, and attendees will be introduced to the beautiful art of writing R / Python code to produce data visualization and build machine learning models. The workshop has a gentle learning slope that is designed with non-technical professionals and academics in mind.
If I don’t have any IT or programming skills, can I still attend this workshop?
Yes, you can still attend the workshop as it is a beginner-friendly workshop.
How to join the interactive-online learning class after I’ve done the payment & registration?
Our system will send you an email containing a link and details to join a Google Classroom.
What platform will be utilized for this online-interactive learning workshop?
Online learning will be conducted via Zoom.us, Link to join the Zoom Class will be announced via Google Classroom.
How will the participants receive the learning materials?
Learning materials can be obtain via Google Classroom
Would I receive a certificate after participating in the Workshop?
Yes, you will receive a certificate of completion.
Sr. Data Science Instructor at Algoritma Data Science School
Dyah Nurlita is an experienced Sr. Data Science Instructor at Algoritma Data Science School, specializing in providing comprehensive training in data science to corporate clients. With a track record of successfully conducting training sessions for esteemed organizations such as Jasa Raharja, Pertamina Hulu Mahakam, Perusahaan Listrik Negara (PLN), and PT. Bank Central Asia (BCA), Lita has honed her expertise in various essential areas of data science. She excels in utilizing Python for Data Analysis, conducting Explanatory Data Analysis, performing Data Wrangling and Visualization, leveraging SQL for Data Manipulation, and applying Programming for Data Science.