DockerHadoopHBaseJavaKubernetesMapReduceMySQLNoSQLPythonRedisSQLRMATLABMachine LearningMLAnalyticsStatistical AnalysisPytestIntegration TestingPostmanRESTfulGitVersion Control
About this role
Role Overview
Develop custom data models to drive innovative business solutions.
Build complex data sets from multiple data sources, both internally and externally.
Conduct advanced statistical analysis to determine trends and significant data relationships.
Build learning systems to analyze and filter continuous data flows and offline data analysis.
Train algorithms to apply models to new data sets.
Validate models and algorithmic techniques.
Scale new algorithms to large data sets.
Combine data features to determine search models.
Research new techniques and best practices within the industry.
Utilize system tools including MySQL, Hadoop, Weka, R, Matlab, and ILog.
Analyze large data sets to develop custom models and algorithms to drive business solutions.
Work on project teams to provide analytical support to projects such as email targeting, business optimization, and consumer recommendations for Walmart eCommerce.
Research new trends in the industry and utilize up-to-date technology (for example, HBase, MapReduce, LAPack, Gurobi) and analytical skills to support assigned projects.
Work with cross-functional partners across the business.
Develop models of current state to determine needed improvements.
Requirements
Master’s degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology, Engineering (any), or related field and 1 year of experience in an analytics related field; OR Bachelor’s degree in Statistics, Economics, Analytics, Mathematics, Computer Science, Information Technology, Engineering (any), or related field and 3 years of experience in an analytics related field.
Experience with object-oriented programming languages: Python and Java.
Experience developing and integrating RESTful APIs for application workflows.
Experience with unit and integration testing using Python frameworks: Pytest and Unittest.
Experience with containerization and infrastructure tools: Docker and Kubernetes.
Experience querying using SQL and relational databases.
Experience with NoSQL database: Redis.
Experience with API testing and response validation using Postman.
Experience designing and maintaining data-driven reporting tools to analyze system and application performance and user trends.
Experience performing anomaly detection using statistical methods: probability distribution, and Machine Learning (ML) models.
Experience building data pipelines and processing log data to generate application insights.
Experience with version control using Git and participating in code reviews.
Tech Stack
Docker
Hadoop
HBase
Java
Kubernetes
MapReduce
MySQL
NoSQL
Python
Redis
SQL
Benefits
Health benefits include medical, vision and dental coverage.
Financial benefits include 401(k), stock purchase and company-paid life insurance.
Paid time off benefits include PTO (including sick leave), parental leave, family care leave, bereavement, jury duty and voting.
Other benefits include short-term and long-term disability, education assistance with 100% company paid college degrees, company discounts, military service pay, adoption expense reimbursement, and more.
Competitive pay as well as performance-based incentive awards and other great benefits.