Data Engineering is a software engineering practice with focus on design, development, and the productionizing of data processing systems. It includes all the practical aspects of data acquisition, transfer, transformation, and storage on-prem or in the cloud.
This Data Engineering with Python training course provides skills to apply Python to the practical aspects of data engineering and introduces to the popular Python libraries used in the field, including NumPy, pandas, Matplotlib, scikit-learn, and Apache Spark.
By attending Data Engineering with Python workshop, delegates will learn:
- Data engineering practice
- High-octane introduction to Python
- Technical reviews of NumPy, pandas, and other Python libraries and data processing systems
- Data visualization and exploratory data analysis
- Data repairing and normalization
- Understanding the data needs and requirements of Machine Learning and Data Science projects
- Python in the Cloud
- Python on Hadoop (PySpark)
- Practical experience coding in one or more modern programming languages. Knowledge of Python is desirable but not necessary.
- Developers, Software Engineers, Data Scientists, and IT Architects
