Capstone Projects in Data Science

The University of Virginia
Charlottesville, Virginia, United States
Associate Professor
4
Timeline
  • October 14, 2024
    Program start
  • April 29, 2025
    Deliverable
  • May 20, 2025
    Final Presentation
  • May 24, 2025
    Program end
Program
2/2 project matches
Dates set by program
Agreements required
Preferred organizations
Anywhere
Any organization type
Any industries

Program scope

Categories
Data visualization Data analysis Data modelling Project management Data science
Skills
data cleansing feature engineering data ethics exploratory data analysis data science data engineering python (programming language) project management machine learning
Participant goals and capabilities

The University of Virginia School of Data Science seeks proposals from academic, commercial, healthcare, nonprofit, and government organizations interested in sponsoring student capstone projects. Capstones are consulting projects that allow students to gain real-world experience in analysis, machine learning, and data engineering. Sponsors are vital partners in these projects, providing a clear objective, data sets, guidance, and orientation to the teams. Capstone projects allow sponsors to explore critical business questions while contributing to the development of the data science workforce.


Capstone projects challenge UVA Data Science students to explore, integrate and analyze data to solve real-world problems. Each team of 3-5 students is advised by a faculty member who has experience in the techniques and data types specific to the project.


All members of the project teams will have completed courses in computational methods for data science, machine learning, and exploratory data analysis, and will possess skills in programming in R and Python. During the program, students also learn feature engineering, data cleaning and wrangling, deep learning, data ethics, project management, team building, technical writing, and oral communication.



Participants

Participants
Graduate
Beginner, Intermediate, Advanced levels
68 participants
Project
100 hours per participant
Educators assign participants to projects
Teams of 4
Expected outcomes and deliverables

Sponsoring organizations gain valuable insights about their operations in tangible reports and other deliverables. Each project has multiple deliverables:

  • A project plan detailing the scope of the problem and plan of work.
  • Progress reports
  • The project ends with a paper and a technical presentation.


These deliverables ensure the quality of the work and provide a vehicle for the students to communicate their results to the sponsors and to contribute to the knowledge and understanding of the data science community.

Project timeline
  • October 14, 2024
    Program start
  • April 29, 2025
    Deliverable
  • May 20, 2025
    Final Presentation
  • May 24, 2025
    Program end

Additional organization criteria

Organizations must answer the following questions to submit a match request to this program:

Please provide a brief description of the data set students will be working with

Are the students required to use any particular platform, environment, language, etc. to work with the data?

Are the tools and technologies required for the internship available as free or open-source software?

Would you be willing to meet with students to provide guidance on a fairly regular basis (e.g., twice a month)?

If you are a corporation, would you be willing to provide a voluntary gift to the university to help fund the capstone program?