Search our courses
Training

This Data Wrangling with Python training course starts with the absolute basics of Python, focusing mainly on data structures. It then delves into the fundamental tools of data wrangling like NumPy and Pandas libraries. You’ll explore useful insights into why you should stay away from traditional ways of data cleaning, as done in other languages, and take advantage of the specialized pre-built routines in Python. This combination of Python tips and tricks will also demonstrate how to use the same Python backend and extract/transform data from an array of sources including the Internet, large database vaults, and Excel financial tables. To help you prepare for more challenging scenarios, you’ll cover how to handle missing or wrong data, and reformat it based on the requirements from the downstream analytics tool.  The course will further help you grasp concepts through real-world examples and datasets.

LEARNING OUTCOMES

  • Use and manipulate complex and simple data structures
  • Harness the full potential of DataFrames and numpy.array at run time
  • Perform web scraping with BeautifulSoup4 and html5lib
  • Execute advanced string search and manipulation with RegEX
  • Handle outliers and perform data imputation with Pandas
  • Use descriptive statistics and plotting techniques
  • Practice data wrangling and modeling using data generation techniques

Data Wrangling with Python

Course Code

GTDDWP

Duration

2 Days

Course Fee

POA

Accreditation

N/A

Target Audience

  • Data Wrangling with Python is designed for developers, data analysts, and business analysts who are keen to pursue a career as a full-fledged data scientist or analytics expert. Although, this course is for beginners, prior working knowledge of Python is necessary to easily grasp the concepts covered here. It will also help to have rudimentary knowledge of relational database and SQL.

Expand all

Course Description

This Data Wrangling with Python training course starts with the absolute basics of Python, focusing mainly on data structures. It then delves into the fundamental tools of data wrangling like NumPy and Pandas libraries. You’ll explore useful insights into why you should stay away from traditional ways of data cleaning, as done in other languages, and take advantage of the specialized pre-built routines in Python. This combination of Python tips and tricks will also demonstrate how to use the same Python backend and extract/transform data from an array of sources including the Internet, large database vaults, and Excel financial tables. To help you prepare for more challenging scenarios, you’ll cover how to handle missing or wrong data, and reformat it based on the requirements from the downstream analytics tool.  The course will further help you grasp concepts through real-world examples and datasets.

LEARNING OUTCOMES

  • Use and manipulate complex and simple data structures
  • Harness the full potential of DataFrames and numpy.array at run time
  • Perform web scraping with BeautifulSoup4 and html5lib
  • Execute advanced string search and manipulation with RegEX
  • Handle outliers and perform data imputation with Pandas
  • Use descriptive statistics and plotting techniques
  • Practice data wrangling and modeling using data generation techniques
Course Outline

Lesson 1: Introduction to Data Structure using Python

  • Python for Data Wrangling
  • Lists, Sets, Strings, Tuples, and Dictionaries

Lesson 2: Advanced Operations on Built-In Data Structure

  • Advanced Data Structures
  • Basic File Operations in Python

Lesson 3: Introduction to NumPy, Pandas, and Matplotlib

  • NumPy Arrays
  • Pandas DataFrames
  • Statistics and Visualization with NumPy and Pandas
  • Using NumPy and Pandas to Calculate Basic Descriptive
  • Statistics on the DataFrame

Lesson 4: Deep Dive into Data Wrangling with Python

  • Subsetting, Filtering, and Grouping
  • Detecting Outliers and Handling Missing Values
  • Concatenating, Merging, and Joining
  • Useful Methods of Pandas

Lesson 5: Get Comfortable with a Different Kind of Data Sources

  • Reading Data from Different Text-Based (and Non-Text-Based) Sources
  • Introduction to BeautifulSoup4 and Web Page Parsing

Lesson 6: Learning the Hidden Secrets of Data Wrangling

  • Advanced List Comprehension and the zip Function
  • Data Formatting

Lesson 7: Advanced Web Scraping and Data Gathering

  • Basics of Web Scraping and BeautifulSoup libraries
  • Reading Data from XML

Lesson 8: RDBMS and SQL

  • Refresher of RDBMS and SQL
  • Using an RDBMS (MySQL/PostgreSQL/SQLite)

Lesson 9: Application in real life and Conclusion of course

  • Applying Your Knowledge to a Real-life Data Wrangling
  • Task
  • An Extension to Data Wrangling
Learning Path
Ways to Attend
  • Attend a public course, if there is one available. Please check our schedule, or register your interest in joining a course in your area.
  • Private onsite Team training also available, please contact us to discuss. We can customise this course to suit your business requirements.

Private Team Training is available for this course

We deliver this course either on or off-site in various regions around the world, and can customise your delivery to suit your exact business needs. Talk to us about how we can fine-tune a course to suit your team's current skillset and ultimate learning objectives.

Private Team Training | Contact us

Technical ICT learning & mentoring services

Private Team Training

Our instructors are specialist consultants with vast real world experience and expertise allowing them to design and deliver client-focused courses for your organisation.

Learn more about our Private Team Training

What Our Clients Say

“I particularly liked the heavy hands on sessions that went on with the training. Other than that, really liked Mark's training style. His experience in the field really shines through.”

 

Docker - GTDK1

Feb ‘19

“Instructor's ability to demonstrate new features that are not part of the course help show his mastery as well as prepare us for changes in the technology. Great work.

 

Using Docker & Kubernetes in Production - GTK8SG

Oct ‘18


“This course was an excellent insight into the Cloud Service Management world and equips me with the tools to go back to my company and build upon it.”

 

Cloud Service Manager - GTC13

Jan ‘19

 

''Fantastic course, looking forward to applying this in my work and home life. Excellent, practical approach, very motivational. I think the entire company should attend training.''

 

Being Agile in Business - GTBAB

Sept '19

“Excellent instructor. You can tell he really understands the concepts he's presenting and is very passionate about his work. He answered every question we asked and presented the course in an interesting and involving manner.”

 

Spring Boot Development - GTIT40

Nov ‘18

"Intelligence is the ability to avoid doing work, yet
getting the work done"

Linus Torvalds, creator of Linux and GIT

Technical ICT learning & mentoring services

About GuruTeam

GuruTeam is a high-level ICT Learning, Mentoring and Consultancy services company. We specialise in delivering instructor-led on and off-site training in Blockchain, Linux, Cloud, Big Data, DevOps, Kubernetes, Agile, Software & Web Development technologies. View our Testimonials

Download our eBrochure
Our Accreditation Partners
  •  
  •  
  •  
  •  

 

Upcoming Courses

Kubernetes Administration

18th - 21st August - Live Online

12th - 15th October - Live Online

This Kubernetes Administration Certification training course is suitable for anyone who wants to learn the skills necessary to build and administer a Kubernetes cluster

Learn More

CompTIA Network+ FastTrack

Coming Soon

This fast-paced course teaches the essentials of networking and helps to prepare the student for the CompTIA Network+ certification.

Learn More

Applied Data Science and Big Data Analytics

Coming Soon

Learn about the theoretical and practical aspects of using Python in the realm of Data Science, Business Analytics, and Data Logistics

Learn More

Introduction to Python 3

8th - 10th September - Live Online

29th September - 1st October-  Live Online

20th - 22nd October  -  Live Online

  10th - 12th November -  Live Online

                                                                                        24th - 26th November -  Live Online

                                                                                         15th - 17th December -  Live Online

Python is a powerful and popular object-oriented programming/scripting language with many high quality libraries.

Learn More

Newsletter

Stay up to date, receive updates on scheduled dates, new courses, offers, and events.

Subscribe to our Newsletter