- Online Degree Explore Bachelor’s & Master’s degrees
- MasterTrack™ Earn credit towards a Master’s degree
- University Certificates Advance your career with graduate-level learning
- Top Courses
- Join for Free

Using SQL for Data Science, Part 1

4.2 (187 ratings)
30K Students Enrolled
Course 4 of 4 in the Learn SQL Basics for Data Science Specialization
This Course
Video Transcript
Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish. You will choose a dataset and develop a project proposal. You will explore your data and perform some initial statistics you have learned through this specialization. You will uncover analytics for qualitative data and consider new metrics that make sense from the patterns that surface in your analysis. You will put all of your work together in the form of a presentation where you will tell the story of your findings. Along the way, you will receive feedback through the peer-review process. This community of fellow learners will provide additional input to help you refine your approach to data analysis with SQL and present your findings to clients and management.
Skills You'll Learn
Presentation Skills, Data Analysis, SQL, creating metrics, Exploratory Data Analysis
- 5 stars 61.49%
- 4 stars 16.04%
- 3 stars 8.55%
- 2 stars 6.95%
- 1 star 6.95%
Oct 12, 2021
This was a great course. It taught me more about SQL in one month than a semester at a top 20 university.
Nov 25, 2021
This guided project was a nice end to the SQL Basics specialization.
From the lesson
Milestone 4: Presenting Your Findings (Storytelling)
In this milestone, you will present your findings. You will identify your audience and create a presentation tailored to them. You will be able to tell the story of analyses and make recommendations.

Research Data Scientist for Deep Insights AI at Intel
Explore our Catalog
Join for free and get personalized recommendations, updates and offers., coursera footer, learn something new.
- Learn a Language
- Learn Accounting
- Learn Coding
- Learn Copywriting
- Learn Public Relations
- Boulder MS Data Science
- Illinois iMBA
- Illinois MS Computer Science
- UMich MS in Applied Data Science
Popular Topics
- Cybersecurity
- Data Analysis
- Data Science
- Machine Learning
- Project Management
Popular Certificates
- Google Data Analytics
- Google Digital Marketing & Ecommerce
- Google IT Automation with Python
- Google IT Support
- Google Project Management
- Google UX Design
- IBM Data Analyst
- IBM Data Science
- Intuit Bookkeeping
- Meta Front-End Developer
Featured Articles
- A Comprehensive Guide to Becoming a Data Analyst
- Advance Your Career With A Cybersecurity Certification
- Get Your Data Analytics Certification
- How to Break into the Field of Data Analysis
- Jumpstart Your Data Career with a SQL Certification
- Learn How to Become PMP Certified
- Start Your Career with CAPM Certification
- Understanding the Role and Responsibilities of a Scrum Master
- Unlock Your Potential with a PMI Certification
- What You Should Know About CompTIA A+ Certification
- What We Offer
- Coursera Plus
- Professional Certificates
- MasterTrack® Certificates
- For Enterprise
- For Government
- Become a Partner
- Coronavirus Response
- Free Courses
- All Courses
- Beta Testers
- Translators
- Teaching Center
- Accessibility
- Modern Slavery Statement

- Online Degree Explore Bachelor’s & Master’s degrees
- MasterTrack™ Earn credit towards a Master’s degree
- University Certificates Advance your career with graduate-level learning
- Top Courses
- Join for Free
Course Introduction and Welcome

4.2 (187 ratings)
30K Students Enrolled
Course 4 of 4 in the Learn SQL Basics for Data Science Specialization
This Course
Video Transcript
Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish. You will choose a dataset and develop a project proposal. You will explore your data and perform some initial statistics you have learned through this specialization. You will uncover analytics for qualitative data and consider new metrics that make sense from the patterns that surface in your analysis. You will put all of your work together in the form of a presentation where you will tell the story of your findings. Along the way, you will receive feedback through the peer-review process. This community of fellow learners will provide additional input to help you refine your approach to data analysis with SQL and present your findings to clients and management.
Skills You'll Learn
Presentation Skills, Data Analysis, SQL, creating metrics, Exploratory Data Analysis
- 5 stars 61.49%
- 4 stars 16.04%
- 3 stars 8.55%
- 2 stars 6.95%
- 1 star 6.95%
Oct 12, 2021
This was a great course. It taught me more about SQL in one month than a semester at a top 20 university.
Nov 25, 2021
This guided project was a nice end to the SQL Basics specialization.
From the lesson
Getting Started and Milestone 1: Project Proposal and Data Selection/Preparation
In this first milestone, you will select your client and import your dataset. You will begin to explore your data to understand it and make assumptions about your data. You will draft a project proposal to act as a guide as you explore your data and prove or disprove your hypotheses.

Research Data Scientist for Deep Insights AI at Intel
Explore our Catalog
Join for free and get personalized recommendations, updates and offers., coursera footer, learn something new.
- Learn a Language
- Learn Accounting
- Learn Coding
- Learn Copywriting
- Learn Public Relations
- Boulder MS Data Science
- Illinois iMBA
- Illinois MS Computer Science
- UMich MS in Applied Data Science
Popular Topics
- Cybersecurity
- Data Analysis
- Data Science
- Machine Learning
- Project Management
Popular Certificates
- Google Data Analytics
- Google Digital Marketing & Ecommerce
- Google IT Automation with Python
- Google IT Support
- Google Project Management
- Google UX Design
- IBM Data Analyst
- IBM Data Science
- Intuit Bookkeeping
- Meta Front-End Developer
Featured Articles
- A Comprehensive Guide to Becoming a Data Analyst
- Advance Your Career With A Cybersecurity Certification
- Get Your Data Analytics Certification
- How to Break into the Field of Data Analysis
- Jumpstart Your Data Career with a SQL Certification
- Learn How to Become PMP Certified
- Start Your Career with CAPM Certification
- Understanding the Role and Responsibilities of a Scrum Master
- Unlock Your Potential with a PMI Certification
- What You Should Know About CompTIA A+ Certification
- What We Offer
- Coursera Plus
- Professional Certificates
- MasterTrack® Certificates
- For Enterprise
- For Government
- Become a Partner
- Coronavirus Response
- Free Courses
- All Courses
- Beta Testers
- Translators
- Teaching Center
- Accessibility
- Modern Slavery Statement

- No suggested jump to results
- Notifications
Capstone Project for Databases and SQL for Data Science Specialization from Coursera
Mega-Barrel/SQL-DataScience-Capstone
Name already in use.
Use Git or checkout with SVN using the web URL.
Work fast with our official CLI. Learn more about the CLI .
- Open with GitHub Desktop
- Download ZIP
Sign In Required
Please sign in to use Codespaces.
Launching GitHub Desktop
If nothing happens, download GitHub Desktop and try again.
Launching Xcode
If nothing happens, download Xcode and try again.
Launching Visual Studio Code
Your codespace will open once ready.
There was a problem preparing your codespace, please try again.
Latest commit
SQL-DataScience-Capstone
- Jupyter Notebook 100.0%
- Online Degree Explore Bachelor’s & Master’s degrees
- MasterTrack™ Earn credit towards a Master’s degree
- University Certificates Advance your career with graduate-level learning
- Top Courses
- Join for Free
SQL in Notebooks

4.2 (187 ratings)
30K Students Enrolled
Course 4 of 4 in the Learn SQL Basics for Data Science Specialization
This Course
Video Transcript
Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish. You will choose a dataset and develop a project proposal. You will explore your data and perform some initial statistics you have learned through this specialization. You will uncover analytics for qualitative data and consider new metrics that make sense from the patterns that surface in your analysis. You will put all of your work together in the form of a presentation where you will tell the story of your findings. Along the way, you will receive feedback through the peer-review process. This community of fellow learners will provide additional input to help you refine your approach to data analysis with SQL and present your findings to clients and management.
Skills You'll Learn
Presentation Skills, Data Analysis, SQL, creating metrics, Exploratory Data Analysis
- 5 stars 61.49%
- 4 stars 16.04%
- 3 stars 8.55%
- 2 stars 6.95%
- 1 star 6.95%
Oct 12, 2021
This was a great course. It taught me more about SQL in one month than a semester at a top 20 university.
Nov 25, 2021
This guided project was a nice end to the SQL Basics specialization.
From the lesson
Getting Started and Milestone 1: Project Proposal and Data Selection/Preparation
In this first milestone, you will select your client and import your dataset. You will begin to explore your data to understand it and make assumptions about your data. You will draft a project proposal to act as a guide as you explore your data and prove or disprove your hypotheses.

Research Data Scientist for Deep Insights AI at Intel
Explore our Catalog
Join for free and get personalized recommendations, updates and offers., coursera footer, learn something new.
- Learn a Language
- Learn Accounting
- Learn Coding
- Learn Copywriting
- Learn Public Relations
- Boulder MS Data Science
- Illinois iMBA
- Illinois MS Computer Science
- UMich MS in Applied Data Science
Popular Topics
- Cybersecurity
- Data Analysis
- Data Science
- Machine Learning
- Project Management
Popular Certificates
- Google Data Analytics
- Google Digital Marketing & Ecommerce
- Google IT Automation with Python
- Google IT Support
- Google Project Management
- Google UX Design
- IBM Data Analyst
- IBM Data Science
- Intuit Bookkeeping
- Meta Front-End Developer
Featured Articles
- A Comprehensive Guide to Becoming a Data Analyst
- Advance Your Career With A Cybersecurity Certification
- Get Your Data Analytics Certification
- How to Break into the Field of Data Analysis
- Jumpstart Your Data Career with a SQL Certification
- Learn How to Become PMP Certified
- Start Your Career with CAPM Certification
- Understanding the Role and Responsibilities of a Scrum Master
- Unlock Your Potential with a PMI Certification
- What You Should Know About CompTIA A+ Certification
- What We Offer
- Coursera Plus
- Professional Certificates
- MasterTrack® Certificates
- For Enterprise
- For Government
- Become a Partner
- Coronavirus Response
- Free Courses
- All Courses
- Beta Testers
- Translators
- Teaching Center
- Accessibility
- Modern Slavery Statement

- Online Degree Explore Bachelor’s & Master’s degrees
- MasterTrack™ Earn credit towards a Master’s degree
- University Certificates Advance your career with graduate-level learning
- Top Courses
- Join for Free
Learn SQL Basics for Data Science Specialization

Financial aid available
What you will learn
Use SQL commands to filter, sort, & summarize data; manipulate strings, dates, & numerical data from different sources for analysis
Assess and create datasets to solve your business questions and problems using SQL
Use the collaborative Databricks workspace and create an end-to-end pipeline that reads data, transforms it, and saves the result
Develop a project proposal & select your data, perform statistical analysis & develop metrics, and present your findings & make recommendations

Skills you will gain
- Data Analysis
- Apache Spark
- Data Science
- A/B Testing
- Query String
- Predictive Analytics
- Presentation Skills
- creating metrics
- Exploratory Data Analysis
About this Specialization
No prior experience required.
Could your company benefit from training employees on in-demand skills?
See how employees at top companies are mastering in-demand skills
How the Specialization Works
Take courses.
A Coursera Specialization is a series of courses that helps you master a skill. To begin, enroll in the Specialization directly, or review its courses and choose the one you'd like to start with. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. It’s okay to complete just one course — you can pause your learning or end your subscription at any time. Visit your learner dashboard to track your course enrollments and your progress.
Hands-on Project
Every Specialization includes a hands-on project. You'll need to successfully finish the project(s) to complete the Specialization and earn your certificate. If the Specialization includes a separate course for the hands-on project, you'll need to finish each of the other courses before you can start it.
Earn a Certificate
When you finish every course and complete the hands-on project, you'll earn a Certificate that you can share with prospective employers and your professional network.

There are 4 Courses in this Specialization
Sql for data science.
As data collection has increased exponentially, so has the need for people skilled at using and interacting with data; to be able to think critically, and provide insights to make better decisions and optimize their businesses. This is a data scientist, “part mathematician, part computer scientist, and part trend spotter” (SAS Institute, Inc.). According to Glassdoor, being a data scientist is the best job in America; with a median base salary of $110,000 and thousands of job openings at a time. The skills necessary to be a good data scientist include being able to retrieve and work with data, and to do that you need to be well versed in SQL, the standard language for communicating with database systems.
This course is designed to give you a primer in the fundamentals of SQL and working with data so that you can begin analyzing it for data science purposes. You will begin to ask the right questions and come up with good answers to deliver valuable insights for your organization. This course starts with the basics and assumes you do not have any knowledge or skills in SQL. It will build on that foundation and gradually have you write both simple and complex queries to help you select data from tables. You'll start to work with different types of data like strings and numbers and discuss methods to filter and pare down your results. You will create new tables and be able to move data into them. You will learn common operators and how to combine the data. You will use case statements and concepts like data governance and profiling. You will discuss topics on data, and practice using real-world programming assignments. You will interpret the structure, meaning, and relationships in source data and use SQL as a professional to shape your data for targeted analysis purposes. Although we do not have any specific prerequisites or software requirements to take this course, a simple text editor is recommended for the final project. So what are you waiting for? This is your first step in landing a job in the best occupation in the US and soon the world!
Data Wrangling, Analysis and AB Testing with SQL
This course allows you to apply the SQL skills taught in “SQL for Data Science” to four increasingly complex and authentic data science inquiry case studies. We'll learn how to convert timestamps of all types to common formats and perform date/time calculations. We'll select and perform the optimal JOIN for a data science inquiry and clean data within an analysis dataset by deduping, running quality checks, backfilling, and handling nulls. We'll learn how to segment and analyze data per segment using windowing functions and use case statements to execute conditional logic to address a data science inquiry. We'll also describe how to convert a query into a scheduled job and how to insert data into a date partition. Finally, given a predictive analysis need, we'll engineer a feature from raw data using the tools and skills we've built over the course. The real-world application of these skills will give you the framework for performing the analysis of an AB test.
Distributed Computing with Spark SQL
This course is all about big data. It’s for students with SQL experience that want to take the next step on their data journey by learning distributed computing using Apache Spark. Students will gain a thorough understanding of this open-source standard for working with large datasets. Students will gain an understanding of the fundamentals of data analysis using SQL on Spark, setting the foundation for how to combine data with advanced analytics at scale and in production environments. The four modules build on one another and by the end of the course you will understand: the Spark architecture, queries within Spark, common ways to optimize Spark SQL, and how to build reliable data pipelines.
The first module introduces Spark and the Databricks environment including how Spark distributes computation and Spark SQL. Module 2 covers the core concepts of Spark such as storage vs. compute, caching, partitions, and troubleshooting performance issues via the Spark UI. It also covers new features in Apache Spark 3.x such as Adaptive Query Execution. The third module focuses on Engineering Data Pipelines including connecting to databases, schemas and data types, file formats, and writing reliable data. The final module covers data lakes, data warehouses, and lakehouses. Students build production grade data pipelines by combining Spark with the open-source project Delta Lake. By the end of this course, students will hone their SQL and distributed computing skills to become more adept at advanced analysis and to set the stage for transitioning to more advanced analytics as Data Scientists.
SQL for Data Science Capstone Project
Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems.
Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish. You will choose a dataset and develop a project proposal. You will explore your data and perform some initial statistics you have learned through this specialization. You will uncover analytics for qualitative data and consider new metrics that make sense from the patterns that surface in your analysis. You will put all of your work together in the form of a presentation where you will tell the story of your findings. Along the way, you will receive feedback through the peer-review process. This community of fellow learners will provide additional input to help you refine your approach to data analysis with SQL and present your findings to clients and management.
Instructors

Sadie St. Lawrence

Katrina Glaeser Poole

Brooke Wenig

Conor Murphy

University of California, Davis
UC Davis, one of the nation’s top-ranked research universities, is a global leader in agriculture, veterinary medicine, sustainability, environmental and biological sciences, and technology. With four colleges and six professional schools, UC Davis and its students and alumni are known for their academic excellence, meaningful public service and profound international impact.
Frequently Asked Questions
What is the refund policy?
If you subscribed, you get a 7-day free trial during which you can cancel at no penalty. After that, we don’t give refunds, but you can cancel your subscription at any time. See our full refund policy .
Can I just enroll in a single course?
Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate, or you can audit it to view the course materials for free. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.
Is financial aid available?
Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.
Can I take the course for free?
When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you only want to read and view the course content, you can audit the course for free. If you cannot afford the fee, you can apply for financial aid .
Is this course really 100% online? Do I need to attend any classes in person?
This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.
Will I earn university credit for completing the Specialization?
This Specialization doesn't carry university credit, but some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.
How long does it take to complete the Specialization?
This Specialization consists of 4 courses that could take anyone from 4-6 months to complete.
What background knowledge is necessary?
This Specialization is intended for the learner with no prior knowledge and will progress through the courses advancing their SQL skills.
Do I need to take the courses in a specific order?
We absolutely recommend you take the first course listed first and the Capstone project last, but courses two and three can be completed in either order.
More questions? Visit the Learner Help Center .
Build employee skills, drive business results
Coursera Footer
Learn something new.
- Learn a Language
- Learn Accounting
- Learn Coding
- Learn Copywriting
- Learn Public Relations
- Boulder MS Data Science
- Illinois iMBA
- Illinois MS Computer Science
- UMich MS in Applied Data Science
Popular Topics
- Cybersecurity
- Machine Learning
- Project Management
Popular Certificates
- Google Data Analytics
- Google Digital Marketing & Ecommerce
- Google IT Automation with Python
- Google IT Support
- Google Project Management
- Google UX Design
- IBM Data Analyst
- IBM Data Science
- Intuit Bookkeeping
- Meta Front-End Developer
Featured Articles
- A Comprehensive Guide to Becoming a Data Analyst
- Advance Your Career With A Cybersecurity Certification
- Get Your Data Analytics Certification
- How to Break into the Field of Data Analysis
- Jumpstart Your Data Career with a SQL Certification
- Learn How to Become PMP Certified
- Start Your Career with CAPM Certification
- Understanding the Role and Responsibilities of a Scrum Master
- Unlock Your Potential with a PMI Certification
- What You Should Know About CompTIA A+ Certification
- What We Offer
- Coursera Plus
- Professional Certificates
- MasterTrack® Certificates
- For Enterprise
- For Government
- Become a Partner
- Coronavirus Response
- Free Courses
- All Courses
- Beta Testers
- Translators
- Teaching Center
- Accessibility
- Modern Slavery Statement


SQL for Data Science Capstone Project

Page Links:
Description
Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Read more.
This resource is offered by an affiliate partner. If you pay for training, we may earn a commission to support this site.
Career Relevance by Data Role
The techniques and tools covered in SQL for Data Science Capstone Project are most similar to the requirements found in Business Analyst job advertisements.
Tools and Techniques
Subscribe for updates.
Similar Opportunities
Data wrangling, analysis and ab testing with sql.
Coursera - University of California, Davis
Analyze Data to Answer Questions
Intermediate sql server, data analysis in social science—assessing your knowledge.
edX - Massachusetts Institute of Technology
How to Transform Tables with SQL
Learn how to analyze business metrics with sql, sql for data analysis, sql analysis for data developers, sql for data science.

COMMENTS
29,594 recent views. Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn ...
Video created by University of California, Davis for the course "SQL for Data Science Capstone Project". In this milestone, you will present your findings. You will identify your audience and create a presentation tailored to them. You will be ...
Capstone project: Twitter activity of the US members of congress. đź”· This repo contains the requested work for my final project of SQL for Data Science Capstone Project by UCDavis/Coursera. đź”· The full analysis and python code can be viewed in the dedicated Jupyter notebook. However, it's usually better to see it with Nbviewer service.
Video created by University of California, Davis for the course "SQL for Data Science Capstone Project". In this first milestone, you will select your client and import your dataset. You will begin to explore your data to understand it and make ...
Capstone Project for Databases and SQL for Data Science Specialization from Coursera License
Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data.
1700 Coursera Courses That Are Still Completely Free. Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed ...
Video created by University of California, Davis for the course "SQL for Data Science Capstone Project". In this first milestone, you will select your client and import your dataset. You will begin to explore your data to understand it and make ...
A Coursera Specialization is a series of courses that helps you master a skill. To begin, enroll in the Specialization directly, or review its courses and choose the one you'd like to start with. ... SQL for Data Science Capstone Project. 4.2. stars. 185 ratings. Data science is a dynamic and growing career field that demands knowledge and ...
Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course ...
Description. Don Noxon. Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Read more. This resource is offered by an affiliate partner.