data science capstone final project

  • No suggested jump to results
  • Notifications

Name already in use

Coursera-data-science-capstone / final-project-submission.rmd.

Data Science Capstone Final Project

This presentation is a short description of a project that will predict the next word of a sentence fragment or phrase.

The application is a capstone project for the Coursera Data Science Specialization provided by Johns Hopkins University with support by Swiftkey.

The main goal is to develop a predictive algorithm. The front end will be as a shiny application and the backend will utilize R.

The application was developed using a sample of twitter tweets (English). This sample was provided by Swiftkey.

There are German, English, Finn, and Russian version. This application will only use the English version.

After loading all of the English data, the algorithm pulled the number of lines, removed profanity and tokenization. The tokenization was organized into n-gram sequences.

The result is a bigram, trigram and a quadrigram models and converted into frequency dictionaries sorted by freq number.

Application

The application relied on functionality and simplicity. By default when loading the application it will check for a word and a message will show requiring entering a word or phrase.

The user can now enter a word or phrase. The application will require the user hit submit. When this happens 3 items will display.

The application started at quadrigram and worked its way down to determine if it can find a predictive word.

The prediction app is hosted on the shinyapps.io location: https://zagnut.shinyapps.io/shiny/

The code for this frontend application is hosted here: https://github.com/motticus/capstone

The data for this application is hosted here: https://d396qusza40orc.cloudfront.net/dsscapstone/dataset/Coursera-SwiftKey.zip

Twitter Facebook Google+

Or copy & paste this link into an email or IM:

Data Science Capstone Projects #18

Ekaterina Butyugina

data-science-city-and-data

Cortexia: Sustainable Clean City - Darkzones Analytics

Students: Dominik Bacher , Valeriia Rutskaia

Results after the predictions

Talmis: Macroeconomic forecasting using machine learning methods

Students: Hussam Al-Homsi , Patrizia Will

Target Country GDP vs User Imputed GDP UK

CancerDataNet: Time predictions for follow-up treatment in cancer patients

Students: Muchun Zhong , Jacques Stimolo , Ernest Mihelj

The Prognostic Models

360° Stock Prediction: Predicting the highest return stocks globally via robust KPIs and perceived company confidence

 Students: Karim Khalil , Fernando Beato , Lukas Doboczky, Rafael Zack  

Stock 360 Logo

Interested in reading more about Constructor Learning and tech related topics? Then check out our other blog posts.

Blog

JHU-Data-Science-Capstone

Coursera Data Science Specialization

View the Project on GitHub

JHU Data Science Capstone Project

The completed project.

A Shiny App for predicting the next word in a string. The App

The Project

Project Overview Sylllabus

Project Tasks - Instructions

Task 0: Understanding the Problem Task 1: Getting and Cleaning the Data Task 2: Exploratory Data Analysis Task 3: Modeling Task 3A: Milestone Report Task 4: Prediction Model Task 5: Creative Exploration Task 6: Data Product Task 7: Slide Deck Task 8: Final Project

Project Scripts - Solutions

Task 0: Exploring the tm Package Task 1: Getting and Cleaning the Data Task 2: Exploratory Data Analysis Task 3A: Milestone Report Task 4: Working toward a Prediction Model Task 04A: Fast Ngram Files Task 05: Prediction Model Task 06: Shiny App Task 06A: Shiny App Source Code Task 07: Slide Presentation

Course Quizzes

Quiz 1 Quiz 2 Quiz 3

Tidy Data Text Mining with R: A Tidy Approach

Capstone Projects

Education is one of the pillars of the data science institute..

Through educational activities, we strive to create a community in Data Science at Columbia. The capstone project is one of the most lauded elements of our MS in Data Science program. As a final step during their study at Columbia, our MS students work on a project sponsored by a DSI industry affiliate or a faculty member over the course of a semester.

Faculty-Sponsored Capstone Projects

A DSI faculty member proposes a research project and advises a team of students working on this project. This is a great way to run a research project with enthusiastic students, eager to try out their newly acquired data science skills in a research setting. This is especially a good opportunity for developing and accelerating interdisciplinary collaboration.

2022-2023 Academic Year FALL 2022: July 15, 2022 SPRING 2023: TBA

Project Archive

Professional and Lifelong Learning

In-person, blended, and online courses, data science: capstone.

data science capstone final project

Associated Schools

data science capstone final project

Harvard T.H. Chan School of Public Health

What you'll learn.

data science capstone final project

Course description

To become an expert data scientist you need practice and experience. By completing this capstone project you will get an opportunity to apply the knowledge and skills in R data analysis that you have gained throughout the series. This final project will test your skills in data visualization, probability, inference and modeling, data wrangling, data organization, regression, and machine learning.

Unlike the rest of our Professional Certificate Program in Data Science, in this course, you will receive much less guidance from the instructors. When you complete the project you will have a data product to show off to potential employers or educational programs, a strong indicator of your expertise in the field of data science.

data science capstone final project

Rafael Irizarry

You may also like.

workplace

Using Data to Design Your Workplace: Offices, Technology, and People

Arrow pointing from a hand holding a smoking cigarette on the left to a head with a pink brain on the right

Causal Diagrams: Draw Your Assumptions Before Your Conclusions

lines of genomic data (dna is made up of sequences of a, t, g, c)

Introduction to Bioconductor

Get updates on new courses..

IMAGES

  1. Why Capstone Project Is A Key Feature While Selecting A Data Science Course

    data science capstone final project

  2. Data Science Capstone Project Showcase

    data science capstone final project

  3. Data Science Summer 2020 Capstone Project Showcase

    data science capstone final project

  4. GitHub

    data science capstone final project

  5. Data Science at Scale

    data science capstone final project

  6. DAT102 Data Science Capstone

    data science capstone final project

VIDEO

  1. Capstone: The Film Industry

  2. Creation, Creativity & Ethics in the Age of AI (DTSC-690)- By Negash Fufa

  3. Capstone Project evaluation by Data Science Experts

  4. Capstone Final Project

  5. Capstone Project Data Karyawan Dyah Ayu Daratika

  6. Data Science Masterclass- Episode 3

COMMENTS

  1. Roderic19/IBM-Applied-Data-Science-Capstone

    Final Project for IBM Data Science Certificate. Contribute to Roderic19/IBM-Applied-Data-Science-Capstone development by creating an account on GitHub.

  2. Coursera-Data-Science-Capstone/Final-Project-Submission.Rmd

    title: "Coursera Data Science Capstone - Final Project Submission". author: "®γσ, Eng Lian Hu". date: "4/28/2016". output: revealjs::revealjs_presentation:.

  3. Data Science Capstone Final Project

    Data Science Capstone Final Project. This project is using own knowledge of data science and basic knowledge of NLPin R to build an app that can predict

  4. Data Science Capstone Final Project

    Data Science Capstone Final Project. David Mott. This presentation is a short description of a project that will predict the next word of a sentence

  5. Data Science Capstone Final Project

    This is the final project of the Coursera Data Science Capstone. In this project, a Word Predictor application was creating using words or

  6. Coursera Data Science Capstone Final Project

    Introduction. This presentation is created to present the final assignment for the Data Sciences Capstone Course, from Coursera course.

  7. Data Science Capstone Projects #18

    A full description of the Data Science Student's final projects.

  8. JHU Data Science Capstone Project

    Coursera Data Science Specialization. ... JHU Data Science Capstone Project. The Completed Project ... Task 7: Slide Deck · Task 8: Final Project

  9. Capstone Projects

    As a final step during their study at Columbia, our MS students work on a project sponsored by a DSI industry affiliate or a faculty member over the course of a

  10. Data Science: Capstone

    This final project will test your skills in data visualization, probability, inference and modeling, data wrangling, data organization, regression, and machine