BeyondTrust is a global leader in identity security, dedicated to creating a safer world through their cybersecurity SaaS portfolio. The Data Engineer will enhance the company's data lake, optimize infrastructure, and develop new data assets to drive business value while solving complex identity security problems.
Responsibilities:
- Take ownership of data modelling to solve complex identity security problems
- Solve complex analytical problems across disparate systems to provide a unified view of security posture in the datalake
- Optimize data workloads at a software level by improving processing efficiency
- Identify opportunities for engineering process improvement and collaborate with senior resources to execute a plan of action
- Use monitoring and observability best practices to ensure optimal pipeline performance
- Assist with ML Operations to ensure optimal model efficiency
- Leverage CI/CD best practices to effectively develop and release source code
Requirements:
- Strong programming and technology knowledge in cloud data processing
- Data warehousing for analytics use cases
- Highly proficient in Python (or another object orientated language)
- Distributed processing framework experience
- Spark
- Realtime processing experience
- Existing Databricks knowledge is a plus
- Interest and aptitude for cybersecurity; interest in identity security is highly preferred
- Technical understanding of underlying systems and computation minutiae
- Experience working with distributed systems and data processing on object stores
- Ability to work autonomously
- Some experience with graph data stores