Design, implement, and optimize LLM-powered systems (e.g., RAG, chat agents, summarizers, knowledge graph integration).
Build and manage data indexing and retrieval pipelines using LlamaIndex, LangChain, or similar frameworks.
Implement and maintain vector databases (e.g., Pinecone, Neo4j, Weaviate, Chroma, or Azure Cognitive Search).
Integrate open-source and proprietary LLMs (e.g., GPT, Claude, Llama) into the CoreStory Platform.
Develop and refine AI-driven features — including generative insights, automated summarization, and narrative analytics.
Collaborate with DevOps and backend teams to deploy scalable AI services within CoreStory’s cloud infrastructure.
Continuously benchmark model performance, latency, and cost, identifying opportunities for optimization.
Stay current with advancements in AI — from model architectures to emerging frameworks — and propose innovative applications aligned with CoreStory’s mission.
Contribute to internal documentation, experimentation frameworks, and evaluation methodologies.
Requirements
7+ years of overall engineering experience with at least 3+ years of experience in AI engineering, machine learning, or applied NLP.
Strong hands-on experience with LlamaIndex, LangChain, or similar orchestration frameworks.