Research and detect valuable data sources and automate collection processes
Perform preprocessing of structured and unstructured data
Design, implement and deliver maintainable and high-quality code using best practices (e.g. Git/Github, Secrets, Configurations, Yaml/JSON)
Review large amounts of information to discover trends and patterns
Create predictive models and machine-learning algorithms
Modify and combine different models through ensemble modeling
Organize and present information using data visualization techniques
Develop and suggest solutions and strategies to business challenges
Work together with engineering and product development teams

3+ years' experience of working on Data Scientist or Data Analyst position
Significant experience in data mining, machine-learning and operations research
Experience with data modeling, design patterns, building highly scalable and secured solutions preferred
Prior experience installing data architectures on Cloud providers (e.g. AWS,GCP,Azure), using DevOps tools and automating data pipelines
Good experience using business intelligence/visualization tools (such as Tableau), data frameworks (such as Hadoop, DataFrames, RDDs, Dataclasses) and data formats (CSV, JSON, Parquet, Avro, ORC)
Advanced knowledge of R, SQL and Python; familiarity with Scala, Java or C++ is an asset
MA or PhD degree in Computer Science, Engineering or other relevant area; graduate degree in Data Science or other quantitative field is preferred
Must be a U.S. Citizen.

Data Scientist

Key skills