Data Science Capstone

  • 4.5
Approx. 6 hours to complete

Course Summary

This course is designed to guide you through the process of completing a data science project from start to finish, including data acquisition and cleaning, exploratory data analysis, data preprocessing, model building, and deployment.

Key Learning Points

  • Learn how to complete a data science project from start to finish
  • Explore data acquisition, cleaning, preprocessing, and modeling techniques
  • Deploy your model and communicate your findings effectively

Related Topics for further study


Learning Outcomes

  • Complete a data science project from start to finish
  • Apply various data acquisition, cleaning, preprocessing, and modeling techniques
  • Deploy and communicate your findings effectively

Prerequisites or good to have knowledge before taking this course

  • Basic programming knowledge in Python
  • Familiarity with data analysis concepts

Course Difficulty Level

Intermediate

Course Format

  • Online Self-Paced
  • Video Lectures
  • Hands-On Projects

Similar Courses

  • Applied Data Science with Python
  • Data Science Essentials
  • Statistics with Python

Related Education Paths


Related Books

Description

The capstone project class will allow students to create a usable/public data product that can be used to show your skills to potential employers. Projects will be drawn from real-world problems and will be conducted with industry, government, and academic partners.

Knowledge

  • Create a useful data product for the public
  • Apply your exploratory data analysis skills
  • Build an efficient and accurate prediction model
  • Produce a presentation deck to showcase your findings

Outline

  • Overview, Understanding the Problem, and Getting the Data
  • Welcome to the Capstone Project
  • Welcome from SwiftKey
  • You Are a Data Scientist Now
  • Introduction to Task 0: Understanding the Problem
  • Introduction to Task 1: Getting and Cleaning the Data
  • Regular Expressions: Part 1 (Optional)
  • Regular Expressions: Part 2 (Optional)
  • A Note of Explanation
  • Project Overview
  • Syllabus
  • Task 0 - Understanding the problem
  • About the Corpora
  • Task 1 - Getting and cleaning the data
  • Quiz 1: Getting Started
  • Exploratory Data Analysis and Modeling
  • Introduction to Task 2: Exploratory Data Analysis
  • Introduction to Task 3: Modeling
  • Task 2 - Exploratory Data Analysis
  • Task 3 - Modeling
  • Prediction Model
  • Introduction to Task 4: Prediction Model
  • Task 4 - Prediction Model
  • Quiz 2: Natural language processing I
  • Creative Exploration
  • Introduction to Task 5: Creative Exploration
  • Task 5 - Creative Exploration
  • Quiz 3: Natural language processing II
  • Data Product
  • Introduction to Task 6: Data Product
  • Task 6 - Data Product
  • Slide Deck
  • Introduction to Task 7: Slide Deck
  • Task 7 - Slide Deck
  • Final Project Submission and Evaluation
  • Congratulations!

Summary of User Reviews

Discover the world of data science with this comprehensive project-based course. Users have rated this course highly for its engaging content and practical approach to learning.

Key Aspect Users Liked About This Course

The practical approach to learning is highly appreciated by many users.

Pros from User Reviews

  • Engaging and practical content
  • Great for beginners in data science
  • Clear and concise explanations
  • Excellent guidance from instructors
  • Real-world applications and case studies

Cons from User Reviews

  • Some users found the course to be too basic
  • The pacing of the course could be improved
  • Not enough emphasis on advanced techniques
  • Limited interaction with instructors
  • Some technical issues with the platform
English
Available now
Approx. 6 hours to complete
Jeff Leek, PhD, Roger D. Peng, PhD, Brian Caffo, PhD
Johns Hopkins University
Coursera

Instructor

Jeff Leek, PhD

  • 4.5 Raiting
Share
Saved Course list
Cancel
Get Course Update
Computer Courses