• Online Degree Explore Bachelor’s & Master’s degrees
  • MasterTrack™ Earn credit towards a Master’s degree
  • University Certificates Advance your career with graduate-level learning
  • Top Courses
  • Join for Free

Using SQL for Data Science, Part 1

video-placeholder

4.2 (187 ratings)

30K Students Enrolled

Course 4 of 4 in the Learn SQL Basics for Data Science Specialization

This Course

Video Transcript

Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish. You will choose a dataset and develop a project proposal. You will explore your data and perform some initial statistics you have learned through this specialization. You will uncover analytics for qualitative data and consider new metrics that make sense from the patterns that surface in your analysis. You will put all of your work together in the form of a presentation where you will tell the story of your findings. Along the way, you will receive feedback through the peer-review process. This community of fellow learners will provide additional input to help you refine your approach to data analysis with SQL and present your findings to clients and management.

Skills You'll Learn

Presentation Skills, Data Analysis, SQL, creating metrics, Exploratory Data Analysis

Oct 12, 2021

This was a great course. It taught me more about SQL in one month than a semester at a top 20 university.

Nov 25, 2021

This guided project was a nice end to the SQL Basics specialization.

From the lesson

Milestone 4: Presenting Your Findings (Storytelling)

In this milestone, you will present your findings. You will identify your audience and create a presentation tailored to them. You will be able to tell the story of analyses and make recommendations.

Placeholder

Research Data Scientist for Deep Insights AI at Intel

Explore our Catalog

Join for free and get personalized recommendations, updates and offers., coursera footer, learn something new.

Popular Topics

Popular Certificates

Featured Articles

Placeholder

Course Introduction and Welcome

video-placeholder

4.2 (187 ratings)

30K Students Enrolled

Course 4 of 4 in the Learn SQL Basics for Data Science Specialization

This Course

Video Transcript

Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish. You will choose a dataset and develop a project proposal. You will explore your data and perform some initial statistics you have learned through this specialization. You will uncover analytics for qualitative data and consider new metrics that make sense from the patterns that surface in your analysis. You will put all of your work together in the form of a presentation where you will tell the story of your findings. Along the way, you will receive feedback through the peer-review process. This community of fellow learners will provide additional input to help you refine your approach to data analysis with SQL and present your findings to clients and management.

Skills You'll Learn

Presentation Skills, Data Analysis, SQL, creating metrics, Exploratory Data Analysis

Oct 12, 2021

This was a great course. It taught me more about SQL in one month than a semester at a top 20 university.

Nov 25, 2021

This guided project was a nice end to the SQL Basics specialization.

From the lesson

Getting Started and Milestone 1: Project Proposal and Data Selection/Preparation

In this first milestone, you will select your client and import your dataset. You will begin to explore your data to understand it and make assumptions about your data. You will draft a project proposal to act as a guide as you explore your data and prove or disprove your hypotheses.

Placeholder

Research Data Scientist for Deep Insights AI at Intel

Explore our Catalog

Join for free and get personalized recommendations, updates and offers., coursera footer, learn something new.

Popular Topics

Popular Certificates

Featured Articles

Placeholder

coursera sql for data science capstone project

Capstone Project for Databases and SQL for Data Science Specialization from Coursera

Mega-Barrel/SQL-DataScience-Capstone

Name already in use.

Use Git or checkout with SVN using the web URL.

Work fast with our official CLI. Learn more about the CLI .

Sign In Required

Please sign in to use Codespaces.

Launching GitHub Desktop

If nothing happens, download GitHub Desktop and try again.

Launching Xcode

If nothing happens, download Xcode and try again.

Launching Visual Studio Code

Your codespace will open once ready.

There was a problem preparing your codespace, please try again.

Latest commit

@Mega-Barrel

SQL-DataScience-Capstone

SQL in Notebooks

video-placeholder

4.2 (187 ratings)

30K Students Enrolled

Course 4 of 4 in the Learn SQL Basics for Data Science Specialization

This Course

Video Transcript

Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish. You will choose a dataset and develop a project proposal. You will explore your data and perform some initial statistics you have learned through this specialization. You will uncover analytics for qualitative data and consider new metrics that make sense from the patterns that surface in your analysis. You will put all of your work together in the form of a presentation where you will tell the story of your findings. Along the way, you will receive feedback through the peer-review process. This community of fellow learners will provide additional input to help you refine your approach to data analysis with SQL and present your findings to clients and management.

Skills You'll Learn

Presentation Skills, Data Analysis, SQL, creating metrics, Exploratory Data Analysis

Oct 12, 2021

This was a great course. It taught me more about SQL in one month than a semester at a top 20 university.

Nov 25, 2021

This guided project was a nice end to the SQL Basics specialization.

From the lesson

Getting Started and Milestone 1: Project Proposal and Data Selection/Preparation

In this first milestone, you will select your client and import your dataset. You will begin to explore your data to understand it and make assumptions about your data. You will draft a project proposal to act as a guide as you explore your data and prove or disprove your hypotheses.

Placeholder

Research Data Scientist for Deep Insights AI at Intel

Explore our Catalog

Join for free and get personalized recommendations, updates and offers., coursera footer, learn something new.

Popular Topics

Popular Certificates

Featured Articles

Placeholder

Learn SQL Basics for Data Science Specialization

Image of instructor, Sadie St. Lawrence

Financial aid available

What you will learn

Use SQL commands to filter, sort, & summarize data; manipulate strings, dates, & numerical data from different sources for analysis

Assess and create datasets to solve your business questions and problems using SQL

Use the collaborative Databricks workspace and create an end-to-end pipeline that reads data, transforms it, and saves the result

Develop a project proposal & select your data, perform statistical analysis & develop metrics, and present your findings & make recommendations

Skills you will gain

About this Specialization

No prior experience required.

Could your company benefit from training employees on in-demand skills?

See how employees at top companies are mastering in-demand skills

How the Specialization Works

Take courses.

A Coursera Specialization is a series of courses that helps you master a skill. To begin, enroll in the Specialization directly, or review its courses and choose the one you'd like to start with. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. It’s okay to complete just one course — you can pause your learning or end your subscription at any time. Visit your learner dashboard to track your course enrollments and your progress.

Hands-on Project

Every Specialization includes a hands-on project. You'll need to successfully finish the project(s) to complete the Specialization and earn your certificate. If the Specialization includes a separate course for the hands-on project, you'll need to finish each of the other courses before you can start it.

Earn a Certificate

When you finish every course and complete the hands-on project, you'll earn a Certificate that you can share with prospective employers and your professional network.

coursera sql for data science capstone project

There are 4 Courses in this Specialization

Sql for data science.

As data collection has increased exponentially, so has the need for people skilled at using and interacting with data; to be able to think critically, and provide insights to make better decisions and optimize their businesses. This is a data scientist, “part mathematician, part computer scientist, and part trend spotter” (SAS Institute, Inc.). According to Glassdoor, being a data scientist is the best job in America; with a median base salary of $110,000 and thousands of job openings at a time. The skills necessary to be a good data scientist include being able to retrieve and work with data, and to do that you need to be well versed in SQL, the standard language for communicating with database systems.

This course is designed to give you a primer in the fundamentals of SQL and working with data so that you can begin analyzing it for data science purposes. You will begin to ask the right questions and come up with good answers to deliver valuable insights for your organization. This course starts with the basics and assumes you do not have any knowledge or skills in SQL. It will build on that foundation and gradually have you write both simple and complex queries to help you select data from tables. You'll start to work with different types of data like strings and numbers and discuss methods to filter and pare down your results. You will create new tables and be able to move data into them. You will learn common operators and how to combine the data. You will use case statements and concepts like data governance and profiling. You will discuss topics on data, and practice using real-world programming assignments. You will interpret the structure, meaning, and relationships in source data and use SQL as a professional to shape your data for targeted analysis purposes. Although we do not have any specific prerequisites or software requirements to take this course, a simple text editor is recommended for the final project. So what are you waiting for? This is your first step in landing a job in the best occupation in the US and soon the world!

Data Wrangling, Analysis and AB Testing with SQL

This course allows you to apply the SQL skills taught in “SQL for Data Science” to four increasingly complex and authentic data science inquiry case studies. We'll learn how to convert timestamps of all types to common formats and perform date/time calculations. We'll select and perform the optimal JOIN for a data science inquiry and clean data within an analysis dataset by deduping, running quality checks, backfilling, and handling nulls. We'll learn how to segment and analyze data per segment using windowing functions and use case statements to execute conditional logic to address a data science inquiry. We'll also describe how to convert a query into a scheduled job and how to insert data into a date partition. Finally, given a predictive analysis need, we'll engineer a feature from raw data using the tools and skills we've built over the course. The real-world application of these skills will give you the framework for performing the analysis of an AB test.

Distributed Computing with Spark SQL

This course is all about big data. It’s for students with SQL experience that want to take the next step on their data journey by learning distributed computing using Apache Spark. Students will gain a thorough understanding of this open-source standard for working with large datasets. Students will gain an understanding of the fundamentals of data analysis using SQL on Spark, setting the foundation for how to combine data with advanced analytics at scale and in production environments. The four modules build on one another and by the end of the course you will understand: the Spark architecture, queries within Spark, common ways to optimize Spark SQL, and how to build reliable data pipelines.

The first module introduces Spark and the Databricks environment including how Spark distributes computation and Spark SQL. Module 2 covers the core concepts of Spark such as storage vs. compute, caching, partitions, and troubleshooting performance issues via the Spark UI. It also covers new features in Apache Spark 3.x such as Adaptive Query Execution. The third module focuses on Engineering Data Pipelines including connecting to databases, schemas and data types, file formats, and writing reliable data. The final module covers data lakes, data warehouses, and lakehouses. Students build production grade data pipelines by combining Spark with the open-source project Delta Lake. By the end of this course, students will hone their SQL and distributed computing skills to become more adept at advanced analysis and to set the stage for transitioning to more advanced analytics as Data Scientists.

SQL for Data Science Capstone Project

Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems.

Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data. You will participate in your own personal or professional journey to create a portfolio-worthy piece from start to finish. You will choose a dataset and develop a project proposal. You will explore your data and perform some initial statistics you have learned through this specialization. You will uncover analytics for qualitative data and consider new metrics that make sense from the patterns that surface in your analysis. You will put all of your work together in the form of a presentation where you will tell the story of your findings. Along the way, you will receive feedback through the peer-review process. This community of fellow learners will provide additional input to help you refine your approach to data analysis with SQL and present your findings to clients and management.

Instructors

Placeholder

Sadie St. Lawrence

Placeholder

Katrina Glaeser Poole

Placeholder

Brooke Wenig

Placeholder

Conor Murphy

Placeholder

University of California, Davis

UC Davis, one of the nation’s top-ranked research universities, is a global leader in agriculture, veterinary medicine, sustainability, environmental and biological sciences, and technology. With four colleges and six professional schools, UC Davis and its students and alumni are known for their academic excellence, meaningful public service and profound international impact.

Frequently Asked Questions

What is the refund policy?

If you subscribed, you get a 7-day free trial during which you can cancel at no penalty. After that, we don’t give refunds, but you can cancel your subscription at any time. See our full refund policy .

Can I just enroll in a single course?

Yes! To get started, click the course card that interests you and enroll. You can enroll and complete the course to earn a shareable certificate, or you can audit it to view the course materials for free. When you subscribe to a course that is part of a Specialization, you’re automatically subscribed to the full Specialization. Visit your learner dashboard to track your progress.

Is financial aid available?

Yes. In select learning programs, you can apply for financial aid or a scholarship if you can’t afford the enrollment fee. If fin aid or scholarship is available for your learning program selection, you’ll find a link to apply on the description page.

Can I take the course for free?

When you enroll in the course, you get access to all of the courses in the Specialization, and you earn a certificate when you complete the work. If you only want to read and view the course content, you can audit the course for free. If you cannot afford the fee, you can apply for financial aid .

Is this course really 100% online? Do I need to attend any classes in person?

This course is completely online, so there’s no need to show up to a classroom in person. You can access your lectures, readings and assignments anytime and anywhere via the web or your mobile device.

Will I earn university credit for completing the Specialization?

This Specialization doesn't carry university credit, but some universities may choose to accept Specialization Certificates for credit. Check with your institution to learn more.

How long does it take to complete the Specialization?

This Specialization consists of 4 courses that could take anyone from 4-6 months to complete.

What background knowledge is necessary?

This Specialization is intended for the learner with no prior knowledge and will progress through the courses advancing their SQL skills.

Do I need to take the courses in a specific order?

We absolutely recommend you take the first course listed first and the Capstone project last, but courses two and three can be completed in either order.

More questions? Visit the Learner Help Center .

Build employee skills, drive business results

Coursera Footer

Learn something new.

Popular Topics

Popular Certificates

Featured Articles

Placeholder

dots

SQL for Data Science Capstone Project

essential-img

Page Links:

Description

Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Read more.

This resource is offered by an affiliate partner. If you pay for training, we may earn a commission to support this site.

Career Relevance by Data Role

The techniques and tools covered in SQL for Data Science Capstone Project are most similar to the requirements found in Business Analyst job advertisements.

Tools and Techniques

Subscribe for updates.

coursera sql for data science capstone project

Similar Opportunities

Data wrangling, analysis and ab testing with sql.

Coursera - University of California, Davis

Analyze Data to Answer Questions

Intermediate sql server, data analysis in social science—assessing your knowledge.

edX - Massachusetts Institute of Technology

How to Transform Tables with SQL

Learn how to analyze business metrics with sql, sql for data analysis, sql analysis for data developers, sql for data science.

COMMENTS

  1. SQL for Data Science Capstone Project

    29,594 recent views. Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn ...

  2. Using SQL for Data Science, Part 1

    Video created by University of California, Davis for the course "SQL for Data Science Capstone Project". In this milestone, you will present your findings. You will identify your audience and create a presentation tailored to them. You will be ...

  3. GitHub

    Capstone project: Twitter activity of the US members of congress. đź”· This repo contains the requested work for my final project of SQL for Data Science Capstone Project by UCDavis/Coursera. đź”· The full analysis and python code can be viewed in the dedicated Jupyter notebook. However, it's usually better to see it with Nbviewer service.

  4. Course Introduction and Welcome

    Video created by University of California, Davis for the course "SQL for Data Science Capstone Project". In this first milestone, you will select your client and import your dataset. You will begin to explore your data to understand it and make ...

  5. GitHub

    Capstone Project for Databases and SQL for Data Science Specialization from Coursera License

  6. Using SQL for Data Science, Part 2

    Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course, this project is your chance to apply the knowledge and skills you have acquired to practice important SQL querying and solve problems with data.

  7. SQL for Data Science Capstone Project

    1700 Coursera Courses That Are Still Completely Free. Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed ...

  8. SQL in Notebooks

    Video created by University of California, Davis for the course "SQL for Data Science Capstone Project". In this first milestone, you will select your client and import your dataset. You will begin to explore your data to understand it and make ...

  9. Learn SQL Basics for Data Science

    A Coursera Specialization is a series of courses that helps you master a skill. To begin, enroll in the Specialization directly, or review its courses and choose the one you'd like to start with. ... SQL for Data Science Capstone Project. 4.2. stars. 185 ratings. Data science is a dynamic and growing career field that demands knowledge and ...

  10. SQL for Data Science Capstone Project (Coursera)

    Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Whether you have successfully completed the other courses in the Learn SQL Basics for Data Science Specialization or are taking just this course ...

  11. SQL for Data Science Capstone Project

    Description. Don Noxon. Data science is a dynamic and growing career field that demands knowledge and skills-based in SQL to be successful. This course is designed to provide you with a solid foundation in applying SQL skills to analyze data and solve real business problems. Read more. This resource is offered by an affiliate partner.