you will be working on Client horizontal data consolidation platform One Data. The platform itself organizes data from both structured and un structed data from the enterprise. It has Gen AI functionalities primarily leveraging data bricks and sits in an AWS environment. What you would be responsible for is the personalization aspect of this platform, there are some requirements coming in that need more focused attention on specific sets of data, so new pipelines and models will have to be created.
Required skills:
Advanced degree in Computer Science, Engineering, or other STEM field with minimum 2 years of industrial experience
Solid understanding of active and continuous learning methodology and machine learning algorithms and techniques, such as regression, classification, clustering, and deep learning
Proficiency in data manipulation and analysis using SQL and other tools (Databricks, AWS).
Experience with Python/Java, including deep learning frameworks and tools such as PyTorch, TensorFlow, Hugging Face, Spacy.
Experience with machine learning, deep learning, reinforcement learning
Experience with AI concepts related to RAG architecture, LLMs and Vector Datastores
Experience with cloud environments(AWS, Azure, GCP) and big data processing frameworks such as Spark.
Ability to learn by doing, to adapt to ever-changing requirements and have a bias for action and outcome
Strong written and verbal communication skills