A Practical Introduction to Data Science Training Course
Participants who complete this training will gain a practical, real-world understanding of Data Science and its related technologies, methodologies and tools.
Participants will have the opportunity to put this knowledge into practice through hands-on exercises. Group interaction and instructor feedback make up an important component of the class.
The course starts with an introduction to elemental concepts of Data Science, then progresses into the tools and methodologies used in Data Science.
Audience
- Developers
- Technical analysts
- IT consultants
Format of the Course
- Part lecture, part discussion, exercises and heavy hands-on practice
Note
- To request a customized training for this course, please contact us to arrange.
Course Outline
Introduction
- The Data Science Process
- Roles and responsibilities of a Data Scientist
Preparing the Development Environment
- Libraries, frameworks, languages and tools
- Local development
- Collaborative web-based development
Data Collection
- Different Types of Data
- Structured
- Local databases
- Database connectors
- Common formats: xlxs, XML, Json, csv, ...
- Un-Structured
- Clicks, censors, smartphones
- APIs
- Internet of Things (IoT)
- Documents, pictures, videos, sounds
- Structured
- Case study: Collecting large amounts of unstructured data continuosly
Data Storage
- Relational databases
- Non-relational databases
- Hadoop: Distributed File System (HDFS)
- Spark: Resilient Distributed Dataset (RDD)
- Cloud storage
Data Preparation
- Ingestion, selection, cleansing, and transformation
- Ensuring data quality - correctness, meaningfulness, and security
- Exception reports
Languages used for Preparation, Processing and Analysis
- R language
- Introduction to R
- Data manipulation, calculation and graphical display
- Python
- Introduction to Python
- Manipulating, processing, cleaning, and crunching data
Data Analytics
- Exploratory analysis
- Basic statistics
- Draft visualizations
- Understand data
- Causality
- Features and transformations
- Machine Learning
- Supervised vs unsurpevised
- When to use what model
- Natural Language Processing (NLP)
Data Visualization
- Best Practices
- Selecting the right chart for the right data
- Color pallets
- Taking it to the next level
- Dashboards
- Interactive Visualizations
- Storytelling with data
Summary and Conclusion
Requirements
- A general understanding of database concepts
- A basic understanding of statistics
Open Training Courses require 5+ participants.
A Practical Introduction to Data Science Training Course - Booking
A Practical Introduction to Data Science Training Course - Enquiry
A Practical Introduction to Data Science - Consultancy Enquiry
Consultancy Enquiry
Testimonials (4)
Hands-on examples allowed us to get an actual feel for how the program works. Good explanations and integration of theoretical concepts and how they relate to practical applications.
Ian - Archeoworks Inc.
Course - ArcGIS Fundamentals
Lab exercise
Tse Kiat - ST Engineering Training & Simulation Systems Pte. Ltd.
Course - Automated Monitoring with Zabbix
All the topics which he covered including examples. And also explained how they are helpful in our daily job.
madduri madduri - Boskalis Singapore Pte Ltd
Course - QGIS for Geographic Information System
The thing I liked the most about the training was the organization and the location
Hamid Tuama - Ability with Innovation General Contracting (DMCC Branch)
Course - ArcGIS for Spatial Analysis
Upcoming Courses
Related Courses
Anaconda Ecosystem for Data Scientists
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at data scientists who wish to use the Anaconda ecosystem to capture, manage, and deploy packages and data analysis workflows in a single platform.
By the end of this training, participants will be able to:
- Install and configure Anaconda components and libraries.
- Understand the core concepts, features, and benefits of Anaconda.
- Manage packages, environments, and channels using Anaconda Navigator.
- Use Conda, R, and Python packages for data science and machine learning.
- Get to know some practical use cases and techniques for managing multiple data environments.
ArcGIS for Spatial Analysis
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at field ecologists and conservation managers who wish to create data spatial projects in ArcGIS.
By the end of this training, participants will be able to:
- Output spatial data as visualizations.
- Conduct geostatics on actual data.
- Implement spatial data analysis, data processing, and mapping with ArcGIS.
- Analyze spatial data for projects in ArcGIS.
ArcMap in ArcGIS
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at intermediate-level environmental scientists and archaeologists who wish to learn how to use ArcMap in ArcGIS for collecting, organizing, managing, and analyzing geographic information.
By the end of this training, participants will be able to:
- Understand the fundamentals of ArcMap and ArcGIS.
- Collect, organize, manage, and analyze geographic information on the social and archaeological elements.
- Conduct spatial queries for impact evaluation.
ArcGIS from Basic to Advanced
35 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at beginner-level to advanced-level GIS professionals and analysts who wish to learn how to effectively use ArcGIS for data visualization, spatial analysis, and geospatial project management.
By the end of this training, participants will be able to:
- Navigate and utilize ArcGIS tools for geospatial data management.
- Create and customize maps with layers and attributes.
- Perform advanced spatial analysis and geoprocessing tasks.
- Automate workflows using ModelBuilder and Python.
ArcGIS Enterprise for Technical Support
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at beginner-level IT support personnel who wish to provide robust support for ArcGIS Enterprise, addressing any anomalies or failures effectively.
By the end of this training, participants will be able to:
- Understand the architecture and components of ArcGIS Enterprise.
- Learn to install, configure, and manage ArcGIS Enterprise.
- Gain skills in troubleshooting and resolving common issues.
- Develop proficiency in monitoring and maintaining ArcGIS Enterprise environments.
- Master the techniques for backup, recovery, and performance optimization.
ArcGIS Fundamentals
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at beginner-level professionals who wish to learn the fundamental concepts and tools of ArcGIS.
By the end of this training, participants will be able to:
- Understand the basic concepts of GIS and spatial data.
- Navigate the ArcGIS interface.
- Create and manage spatial data.
- Perform basic spatial analysis.
- Create maps and visualizations.
ArcGIS Pro for Spatial Analysis
14 HoursBy the end of this training, participants will be able to:
- Understanding GIS concepts and spatial data types using the latest version of ArcGIS Pro.
- Investigate the user interface and evaluate the uses
- Explore ArcGIS* Pro and how to utilize online content
- Manage, manipulate and analyze your own data
- Display and share your data in a meaningful and creative way
Note: We will use the latest version of ArcGIS Pro.
Advanced ArcGIS Pro for Spatial Analysis
35 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at advanced-level GIS professionals who wish to use ArcGIS Pro to enhance their spatial analysis capabilities, conduct comprehensive geostatistical analysis, and apply advanced 3D modeling techniques for more effective decision-making and problem-solving in real-world scenarios.
By the end of this training, participants will be able to:
- Develop advanced skills in spatial analysis techniques using ArcGIS Pro.
- Utilize Python scripting for automation and complex data processing.
- Apply spatial modeling for problem-solving in real-world scenarios.
- Conduct geostatistical analysis for advanced data interpretation.
- Integrate external data sources and leverage 3D spatial data analysis.
ArcGIS with Python Scripting
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at GIS analysts who wish to automate repetitive tasks in GIS processes.
By the end of this training, participants will be able to:
- Build GIS applications using Python and ArcGIS tools.
- Develop with the ArcGIS package ArcPy, using Python.
- Apply the ArcGIS modules for map automation using object classes in Python.
Automated Monitoring with Zabbix
14 HoursThis instructor-led, live training in Austria (online or onsite) covers the installation, planning and configuration of Zabbix, and focuses on practical implementation and tooling.
By the end of this training, participants will be able to:
- Install and configure Zabbix for monitoring IT infrastructure.
- Set up and manage hosts, items, triggers, and actions within Zabbix.
- Utilize Zabbix's features for data collection, alerting, and reporting.
- Integrate Zabbix with other tools and platforms for enhanced monitoring and automation.
Insurtech: A Practical Introduction for Managers
14 HoursInsurtech (a.k.a Digital Insurance) refers to the convergence of insurance + new technologies. In the field of Insurtech "digital insurers" apply technology innovations to their business and operating models in order to reduce costs, improve the customer experience and enhance the agility of their operations.
In this instructor-led training, participants will gain an understanding of the technologies, methods and mindset needed to bring about a digital transformation within their organizations and in the industry at large. The training is aimed at managers who need to gain a big picture understanding, break down the hype and jargon, and take the first steps in establishing an Insurtech strategy.
By the end of this training, participants will be able to:
- Discuss Insurtech and all its component parts intelligently and systematically
- Identify and demystify the role of each key technology within Insurtech.
- Draft a general strategy for implementing Insurtech within their organization
Audience
- Insurers
- Technologists within the insurance industry
- Insurance stakeholders
- Consultants and business analysts
Format of the course
- Part lecture, part discussion, exercises and case study group activities
Kaggle
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at data scientists and developers who wish to learn and build their careers in Data Science using Kaggle.
By the end of this training, participants will be able to:
- Learn about data science and machine learning.
- Explore data analytics.
- Learn about Kaggle and how it works.
Accelerating Python Pandas Workflows with Modin
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at data scientists and developers who wish to use Modin to build and implement parallel computations with Pandas for faster data analysis.
By the end of this training, participants will be able to:
- Set up the necessary environment to start developing Pandas workflows at scale with Modin.
- Understand the features, architecture, and advantages of Modin.
- Know the differences between Modin, Dask, and Ray.
- Perform Pandas operations faster with Modin.
- Implement the entire Pandas API and functions.
QGIS for Geographic Information System
21 HoursA geographic information system (GIS) is a system designed to capture, store, manipulate, analyze, manage, and present spatial or geographic data. The acronym GIS is sometimes used for geographic information science (GIScience) to refer to the academic discipline that studies geographic information systems and is a large domain within the broader academic discipline of geoinformatics.
QGIS functions as geographic information system (GIS) software, allowing users to analyze and edit spatial information, in addition to composing and exporting graphical maps. QGIS supports both raster and vector layers; vector data is stored as either point, line, or polygon features. Multiple formats of raster images are supported, and the software can georeference images. To summarize it allows the users to Create, edit, visualise, analyse and publish geospatial information on Windows, Mac, Linux, BSD.
This program, in its first phase, introduces the QGIS interface for general usage. In the second phase, we introduce PyQGIS - the python libraries of QGIS that allows the integration of GIS functionalities in your python code or your python application, so that you may even create your own Python Plugin around a particular GIS functionality.
GPU Data Science with NVIDIA RAPIDS
14 HoursThis instructor-led, live training in Austria (online or onsite) is aimed at data scientists and developers who wish to use RAPIDS to build GPU-accelerated data pipelines, workflows, and visualizations, applying machine learning algorithms, such as XGBoost, cuML, etc.
By the end of this training, participants will be able to:
- Set up the necessary development environment to build data models with NVIDIA RAPIDS.
- Understand the features, components, and advantages of RAPIDS.
- Leverage GPUs to accelerate end-to-end data and analytics pipelines.
- Implement GPU-accelerated data preparation and ETL with cuDF and Apache Arrow.
- Learn how to perform machine learning tasks with XGBoost and cuML algorithms.
- Build data visualizations and execute graph analysis with cuXfilter and cuGraph.