Sr Principal Data Scientist
RELX

San Diego, California

Posted in IT


This job has expired.

Job Info


Senior Principal Data Scientist for the Author Profiling Team

This role can sit anywhere in the US

The Author Profiler is the world's leading researcher disambiguation system, powering some of the world's foremost scholarly databases such as Scopus, ACM Digital Library, and IEEE Explore. Author Profiles are automatically constructed from a diverse array of sources that include tens of millions of research artifacts and cover millions of publishing researchers. We are looking for a data scientist with at least 5 years of experience in Machine Learning, to take a leading role in our team.



Responsibilities

  • Advance the development of our models for author matching and clustering.
  • Perform root cause analysis and design solutions for known issues.
  • Enhance the author profiler system to support new requirements.
  • Work on varied projects to extend the author profiler to new content types and new product areas.
  • Develop a deep understanding of our systems and algorithms and the scholarly data we process. This is necessary to enable identification of opportunities for fundamental improvements in our algorithms, and to enable identification of root causes of problems
  • Work closely with other data scientists in our team, with a track to team leadership.
  • ·Collaborate with the content curation team to create training and evaluation datasets to be used to improve the author profiler. This may include devising new sampling techniques and evaluation protocols for targeted improvements.
  • Work closely with our software development team to develop new tools, conduct experiments and develop new features, and help implement and optimize future versions of the author profiler.
  • Work independently with end-to-end ownership of tasks.
  • Work cooperatively with other team members in the U.S., Europe, and India.


Requirements
  • Ph.D. (or very strong Masters) in Computer Science, Data Science, Machine Learning, Applied Statistics, or related disciplines
  • Experience working on commercial data science projects that have been deployed in production.
  • Deep knowledge of Machine Learning, as well as coding proficiency using common libraries and frameworks.
  • Experience with clustering large (10s of millions) data sets, ideally involving person data.
  • Languages: Strong experience with Python and Java
  • Big Data/Cloud Environments: experience with Spark/pySpark/DataBricks and AWS
  • Ability to work independently and as part of a team.
  • Strong communication skills and the ability to analyze complex problems and devise effective solutions
  • Beneficial to have: SQL, XML

-----------------------------------------------------------------------

Elsevier is an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. If a qualified individual with a disability or disabled veteran needs a reasonable accommodation to use or access our online system, that individual should please contact accommodations@relx.com or if you are based in the US you may also contact us on 1.855.833.5120.

Please read our Candidate Privacy Policy


This job has expired.

More IT jobs


CoreLogic Solutions, LLC
Oklahoma City, Oklahoma
Posted about 1 hour ago

CoreLogic Solutions, LLC
Dallas, Texas
Posted about 1 hour ago

Civica
Petersburg, Virginia
Posted 5 minutes ago

Get Hired Faster

Subscribe to job alerts and upload your resume!

*By registering with our site, you agree to our
Terms and Privacy Policy.