Junior Data Scientist (2 Months Summer Internship)

Added: March 18, 2024
Precisely is the leader in data integrity. We empower businesses to make more confident decisions based on trusted data through a unique combination of software, data enrichment products and strategic services. What does this mean to you? For starters, it means joining a company focused on delivering outstanding innovation and support that helps customers increase revenue, lower costs and reduce risk. In fact, Precisely powers better decisions for more than 12,000 global organizations, including 99 of the Fortune 100. Precisely's 2500 employees are unified by four company core values that are central to who we are and how we operate: Openness, Determination, Individuality, and Collaboration. We are committed to career development for our employees and offer opportunities for growth, learning and building community. With a "work from anywhere" culture, we celebrate diversity in a distributed environment with a presence in 30 countries as well as 20 offices in over 5 continents. Learn more about why it's an exciting time to join Precisely!

Intro And Job Overview

Precisely’s Summer Internship Program is a paid, part-time opportunity for Masters students actively enrolled in university. This is an 8-week program that begins June 10th and concludes August 2nd. This opportunity is 100% remote located in the EST timezone. The Junior Data Scientist Intern will work closely with the R&D team.

The intern will work on a project that will focus on identifying the component parts of a postal addresses so that it can be used in data analytics and AI projects. Precisely already has technologies that parse addresses and extract component elements, such as house number, unit number, street, city, state, postal codes, country etc, but the success rate is limited. While existing tech works well for well-structured addresses, there is room for improvement when addresses are not well-structured, or have lots of errors. This project will explore improving the address parser success rate by employing NER (Named entity recognition) techniques.

Responsibilities And Duties

  • The deliverables will consist of a report detailing the methodology, exploratory data analysis (EDA), hypotheses and results for the different approaches and experiments, along with learnings and recommendations for future work.
  • Daily standup meetings with the team to update on the progress of the project
  • Internal documentation with details about pass/fail approaches and iterations

Requirements And Qualifications

  • Experience with python, particularly pandas and scikit-learn.
  • Experience in natural language processing, namely named entity resolution and tagging.
  • Fundamentals of statistics, statistical learning, design of experiments and linear algebra skills.
  • Ability to visualize trends in data and be able to express their ideas, hypotheses and results in visualized data.

Program Benefits

  • Valuable experience related to the degree you are pursuing.
  • Hands on experience at an established tech company.
  • Networking opportunities with company leaders around the globe.

It is a requirement for all roles at Precisely to adhere to applicable data privacy and security laws, rules, regulations, and company policies. For more information about Precisely’s privacy practices, please see our Privacy Notice: