Seminare
Seminare

Data Scientist with Python: Data pipelines with machine learning algorithms and Python – the online training with certificate of completion

Webinar - Haufe Akademie GmbH & Co. KG

Being able to process and analyze data automatically and in real time and derive insights from it is one of the key requirements of companies. Building the data pipelines for this is the task of data scientists - a professional field that is currently in particularly high demand and offers great opportunities. This certified online training course enables you to set up data mining processes, apply machine learning algorithms, create predictive models and implement them productively in automated workflows. The course uses the Python programming language with its leading machine learning libraries. This online course is designed so that you can learn flexibly and at your own pace. You can expect videos, interactive graphics, texts and many practical exercises with extensive data sets and coding tasks. Experienced data analysts are on hand as mentors to answer your questions.
Termin Ort Preis*
09.12.2024- 25.05.2025 online 5.176,50 €
10.02.2025- 10.08.2025 online 5.176,50 €
24.03.2025- 21.09.2025 online 5.176,50 €
05.05.2025- 02.11.2025 online 5.176,50 €
16.06.2025- 14.12.2025 online 5.176,50 €
28.07.2025- 25.01.2026 online 5.176,50 €
08.09.2025- 08.03.2026 online 5.176,50 €
20.10.2025- 19.04.2026 online 5.176,50 €

Alle Termine anzeigen

*Alle Preise verstehen sich inkl. MwSt.

Detaillierte Informationen zum Seminar

Inhalte:

1. Basics of data analytics with Python


  • Working with the Data Lab
  • Basics and concepts in Python
  • Presentation of the tools pandas, matplotlib and Seaborn
  • Database queries with SQL Alchemy


2. Linear algebra


  • Mathematical background
  • Basic concepts of linear algebra
  • Calculation with vectors and matrices
  • Use of the Python library numpy


3. Probability distribution


  • Statistics in data science algorithms
  • Discrete and continuous distributions
  • Versioning code in Git


4. Supervised learning (regression)


  • Concepts of supervised learning
  • Using linear regression
  • Using the Python package sklearn
  • Understanding regression models
  • Evaluation of predictions
  • Bias variance trade-off and regularization
  • Measuring the quality of the model


5. Supervised learning (classification)


  • Introduction to classification algorithms
  • The k-Nearest Neighbors algorithm
  • Assessment of classification performance
  • Optimization of the parameters
  • Splitting the data into training and evaluation sets


6. Unsupervised learning (clustering)


  • Concepts of unsupervised learning
  • The k-Means algorithm
  • Evaluation of the performance metrics
  • Alternatives to k-means clustering


7. Unsupervised learning (dimension reduction)


  • Reducing dimensions in the data view
  • Principal Component Analysis (PCA)
  • Creating uncorrelated features from original data
  • Introduction to feature engineering


8. Identification and excluding outliers


  • Methods for detecting outliers
  • Criteria for unusual data points
  • Robust measures and reducing the influence of outliers


9. Collecting and merging data


  • Reading data from web pages and PDF documents
  • Use of regular expressions
  • Structuring text data before processing


10. Logistic regression


  • Concepts of logistic regression
  • Performance metrics for evaluation
  • Using non-numerical data in models


11. Decision trees and random forests


  • The concept of decision trees
  • Combining multiple models into ensembles
  • Methods for improving the quality of predictions


12. Support Vector Machines


  • Use of Support Vector Machines (SVM)
  • Introduction to Natural Language Processing (NLP)
  • Text classification with bag-of-words models


13. Neural networks


  • Basics of artificial neural networks
  • Basics of deep learning
  • Deeper understanding of neural network layers


14. Visualization and model interpretation


  • Derive and visualize functionalities of models
  • Methods for interpretation and visualization
  • Apply model-agnostic methods


15. Using distributed databases


  • Using the Python package PySpark
  • Reading data from distributed databases
  • Basics of big data analysis
  • Using machine learning algorithms in distributed systems


16. Exercise project


  • Work on a comprehensive exercise project independently
  • Solve a prediction problem using a larger data set
  • Preparation for the final project


17. Final project


  • Independent analysis of the data project
  • Presentation of results and 1:1 feedback session with mentoring team
  • Certificate for Data Scientist with Python
Dauer/zeitlicher Ablauf:
18 weeks (6 h/week)
Ziele/Bildungsabschluss:

In this practice-oriented training course, you will learn how to carry outdata analyses with large data sets independently.

You will learn how to use Python competently, how to use the programming language for data analysis and how to create effective visualizations.

You will learn how to connect different data sources, filter and merge data from them.

You will learn comprehensive methods, algorithms and technologies of machine learning and how to use them with Python packages.

You will learn everything you need to know about the use of deep learning and create an artificial neural network with multiple layers

After the training, you will be able to visualize company data in a meaningful way and make it interactively accessible in dynamic dashboards.

The technical entry hurdles are minimized by the use of Jupyter Notebooks, with which you can carry out the programming exercises directly in the browser.

Zielgruppe:

The online training course to become a Data Scientist with Python is suitable for anyone who wants to learn Python as a programming language and use it to carry out data analyses independently. No special requirements need to be met. The course is also suitable for career changers.

Seminarkennung:
30676
Nach unten
Nach oben
Wir setzen Analyse-Cookies ein, um Ihre Zufriedenheit bei der Nutzung unserer Webseite zu verbessern. Diese Cookies werden nicht automatisiert gesetzt. Wenn Sie mit dem Einsatz dieser Cookies einverstanden sind, klicken Sie bitte auf Akzeptieren. Weitere Informationen finden Sie hier.
Akzeptieren Nicht akzeptieren









Um Spam abzuwehren, geben Sie bitte die Buchstaben auf dem Bild in das Textfeld ein:

captcha



Bei der Verarbeitung Ihrer personenbezogenen Daten im Zusammenhang mit der Kontaktfunktion beachten wir die gesetzlichen Bestimmungen. Unsere ausführlichen Datenschutzinformationen finden Sie hier. Bei der Kontakt-Funktion erhobene Daten werden nur an den jeweiligen Anbieter weitergeleitet und sind nötig, damit der Anbieter auf Ihr Anliegen reagieren kann.







Um Spam abzuwehren, geben Sie bitte die Buchstaben auf dem Bild in das Textfeld ein:

captcha