BeyondTrust is a global leader in identity security, focused on creating a safer world through their cybersecurity SaaS portfolio. As a Data Engineer, you will enhance their data lake, optimize existing infrastructure, and develop new data assets to drive business value while leveraging advanced data engineering tools.
Responsibilities:
- Take ownership of data modelling to solve complex identity security problems
- Solve complex analytical problems across disparate systems to provide a unified view of security posture in the datalake
- Optimize data workloads at a software level by improving processing efficiency
- Identify opportunities for engineering process improvement and collaborate with senior resources to execute a plan of action
- Use monitoring and observability best practices to ensure optimal pipeline performance
- Assist with ML Operations to ensure optimal model efficiency
- Leverage CI/CD best practices to effectively develop and release source code
Requirements:
- Strong programming and technology knowledge in cloud data processing
- Spark experience is needed, existing Databricks knowledge is a plus
- Technical understanding of underlying systems and computation minutiae
- Experience working with distributed systems and data processing on object stores
- Ability to work autonomously
- Data warehousing for analytics use cases
- Highly proficient in Python (or another object orientated language)
- Distributed processing framework experience
- Interest and aptitude for cybersecurity; interest in identity security is highly preferred
- Highly Preferred: Spark
- Highly Preferred: Realtime processing experience
- Ideal: Some experience with graph data stores