fbpx

Real-Time Data Processing with Kafka and Python

A Beginner’s Guide to Streaming Data

  • Schedule

    3 – 6 March 2025

    18.30 – 21.30 (WIB)

  • Online-Interactive Learning

    Via Zoom

  • Investment

    Rp. 1.500.000

Course Summary

In the modern world, real-time data is essential for businesses to stay competitive. From detecting fraudulent transactions to providing personalized recommendations, the ability to process and analyze data as it arrives is becoming more and more critical. However, traditional data processing methods often struggle to keep up with the fast-paced nature of real-time data.

Apache Kafka is a tool that enables businesses to process large amounts of data in real-time by creating data pipelines that can send and receive data continuously. Coupled with Python, a beginner-friendly programming language, Kafka can be used to build powerful applications that handle streaming data.

This workshop will introduce absolute beginners to the basics of real-time data processing with Kafka and Python, without requiring prior experience in programming or complex tools.

Learning Outcomes

Upon completion of this workshop, participants will be able to:

  • Understand the concept of real-time data processing and why it is important.
  • Learn the basics of Apache Kafka and its role in data streaming.
  • Set up Kafka and Python on their computer to create data pipelines.
  • Write simple Python programs to send and receive data using Kafka.
  • Build a simple real-time data processing application.

Syllabus

  • Overview of streaming data and real-time applications
  • Introduction to Apache Kafka
  • Introduction to Python
  • Installing Apache Kafka
  • Setting up Python and Jupyter Notebook
  • Installing necessary Python libraries
  • Introduction to the Kafka-Python library.
  • Creating Kafka producers to send data
  • Creating Kafka consumers to retrieve and process data
  • Connecting Kafka to real-time data sources (e.g., log files, simulated sensor data).
  • Processing streaming data using Python.
  • Data Transformation and Analytics
    • Filtering, transforming, and aggregating data from Kafka topics.
    • Visualizing real-time data using Python libraries (e.g., Pandas).
  • Designing a real-time data pipeline for a business use case.
  • Building a streaming application for real-time data processing.

STUDENT TESTIMONIALS

This testimonial video is taken after our previous Online Data Science Series: Time Series Analysis for Business Forecasting.

LEARN FROM ANYWHERE

Our learning format is online-interactive, you will feel the interactive experience as if you were present in a physical classroom. You can access the class using your Zoom account on pre-defined dates.

  • LEARN AT YOUR OWN PACE

    Zoom recording, course Books (PDF & HTML files), the dataset for practice, reference notes, and working files are accessible through our Learning Management System account.

  • PROOF YOUR MASTERY

    Show current and prospective employers of your mastery in computer vision with a signed certificate of completion.

  • CONNECT WITH LIKE MINDED PEOPLE

    Be a part of our data-passionate community with 5000+ members and 1000+ alumni.

FOR ABSOLUTE BEGINNERS

Workshops in this series are tailored to casual programmers and non-programmers that are taking their first steps into data science. It assumes no prior knowledge or academic background, and attendees will be introduced to the beautiful art of writing R / Python code to produce data visualization and build machine learning models. The workshop has a gentle learning slope that is designed with non-technical professionals and academics in mind.

Yes, you can still attend the workshop as it is a beginner-friendly workshop.

Our system will send you an email containing a link and details to join a Google Classroom.

Online learning will be conducted via Zoom.us, Link to join the Zoom Class will be announced via Google Classroom.

Learning materials can be obtain via Google Classroom

Yes, you will receive a certificate of completion.

YOUR INSTRUCTOR

Dwi Gustin Nurdialit

Sr. Data Science Instructor at Algoritma Data Science School

Dwi Gustin Nurdialit is a Senior Data Science Instructor with over 3 years of experience using programming languages like R, Python, and SQL. She has excelled as a lead instructor in numerous corporate training sessions spanning diverse industries, including finance, retail, logistics, mining, and telecommunications. Dwi also plays a key role as a course producer, creating materials for corporate training programs, focusing on data visualization, machine learning, and Large Language Models (LLM). Her passion lies in making data science accessible and practical for everyone.