fbpx

Tactical Data Science

for

Beginning Practitioners

A more systemic, structural approach to data science tasks

Tactical Data Science for Beginning Practitioners

  • Wednesday, 27 November 2019

    18.00 – 21.30

  • IDR 100,000

    Workshop will be delivered in English

Course Summary

An alarming amount of data science resources online and offline focus on the syntax and semantics of R and Python programming languages.

While getting to grips with the language is highly important for any aspiring data scientists, understanding the nuances behind the work can offer a huge boost in productivity and clarity.

This workshop will instill a helpful mental model that can help you be more productive throughout your analysis, regardless of your programming language of choice. Participants are encouraged to download the free accompanying course book if they wish to practice on the topic.

Tactical data science for beginning practitioners zooms in on the mental models behind common data preparation tasks, and aim to equip beginning practitioners with some practical advice on wrangling data in a productive manner.

The workshop’s materials, assets and datasets are publicly available on GitHub. The workshop emphasizes practical techniques and problem-solving mentalities that:

  • Favor a problem-solving style that is efficient and yields a short feedback cycle
  • Isolate contexts by diagnosing your data before exploring it
  • Diagnostic questions are sanity checks. They asks: “does the state of data conform to my expectations and statistical reality?”
  • Exploratory questions are concerned with pattern discovery. They asks: “how can the information in the data be applied?”
  • Incorporate prior knowledge and domain knowledge in your data preparation tasks
  • Preserve an unmodified copy of the data and understand that every language behave differently and yield different results even if they look semantically similar

The instructor will work through the materials in R and Python code, so a beginner-level familiarity in either language is assumed but not required.

Instructor

Samuel Chan

Machine learning practitioner in the field of marketing automation, fraud detection, finance and e-commerce. Samuel is Indonesia’s top-ranked Stack Overflow user in R (top 5% worldwide), a certified professional (certificates from Microsoft, MongoDB, Stanford University, John Hopkins University), and an experienced consultant that has worked with several public-trading companies from his time staying in China, Japan and Singapore.

Between 2017 and 2018, Samuel has trained and consulted with more than 20 companies around Indonesia and a regular guest speaker/trainer in a number of universities in Singapore and Indonesia. He is also among the first recipients of Microsoft Professional Program Certificate in Data Science in Southeast Asia, having demonstrated proficiency in R, Python, Microsoft Azure, SQL / T-SQL, PowerBI and a list of other technologies.

Partners

Yellowfin

Yellowfin Suite is a mature, user-friendly BI platform that offers different modules for diverse tasks including data preparation, data discovery, report building and dashboards as well as Signals and Stories. It offers engaging visualization, collaboration and storyboarding features that demonstrate the company’s emphasis and experience in making BI content consumption as easy as possible while reaching as many users as possible.

Single integrated solution developed for companies across varying industries and scaling sizes. Yellowfin also used to transform, access, analyse and report on data held in common business sources including spreadsheets, Web APIs and databases. Now Yellowfin BI delivering insight that matter to 27000+ companies worldwide.

Workshop Receivables:

  • Workshop Lecturer’s Notes

    Including an e-course book (PDF) and/or HTML files.

  • Highly-accelerated Learning

    Learn under the assistance of mentorship of our lead instructor.

  • Quality Learning Environment

    We pay meticulous attention to the logistical details of our workshops: quality audio and visual setups, comfortable sitting arrangements. Snacks are included for evening workshops.

Kickstart Series

Workshops in our Kickstart series are tailored to casual programmers and non-programmers that are taking their first steps into data science. It assumes no prior knowledge or academic background, and attendees will be introduced to the beautiful art of writing R / Python code to produce data visualization and build machine learning models.

Students are encouraged to bring along their laptop and download the course materials beforehand if they wish to follow along with the Code Along exercises. The workshop has a gentle learning slope that is designed with non-technical professionals and academics in mind.

Past Workshops in this Series: