Job Description: GenAI Data Scientist - Associate
PwC US - Acceleration Center is seeking an emerging GenAI Data Scientist to join our team at the Associate level. This role provides an exciting opportunity to contribute to developing and implementing machine learning models and algorithms for our GenAI projects. The ideal candidate should have foundational knowledge in data science, with an interest in GenAI technologies, and possess an understanding of statistical analysis, machine learning, data visualization, and basic application programming.
Responsibilities:
Assist cross-functional teams in gathering business requirements and identifying opportunities for applying GenAI technologies.
Support the development and implementation of machine learning models and algorithms under the guidance of senior data scientists.
Participate in data cleaning, preprocessing, and feature engineering to prepare data for analysis.
Work closely with data engineers to facilitate efficient data processing and integration into machine learning pipelines.
Help validate and evaluate model performance using standard metrics and techniques.
Contribute to the development and deployment of machine learning applications and solutions.
Utilize basic object-oriented programming skills to assist in building software components.
Gain experience with Kubernetes for container orchestration and deployment.
Assist in designing and building chatbots using GenAI technologies.
Help communicate findings and insights to stakeholders through basic data visualizations and reports.
Keep abreast of the latest advancements in GenAI technologies and contribute to discussions on innovative solutions to enhance data science processes.
Requirements:
Bachelor's degree in Data Science, Computer Science, Statistics, or a related field.
1-3 years of relevant experience, ideally with exposure to GenAI projects.
Proficient programming skills in Python, R, or Scala.
Familiarity with machine learning libraries and frameworks such as TensorFlow, PyTorch, or scikit-learn.
Basic experience with data preprocessing, feature engineering, and data wrangling techniques.
Understanding of statistical analysis and experimental design.
Awareness of cloud computing platforms such as AWS, Azure, or Google Cloud.
Some knowledge of data visualization tools and techniques.
Strong problem-solving and analytical skills.
Good communication and teamwork abilities.
Ability to thrive in a fast-paced and dynamic environment.
Preferred Qualifications:
Some experience with object-oriented programming languages such as Java, C++, or C#.
Exposure to developing or assisting in the deployment of machine learning applications.
Basic understanding of data privacy and compliance issues.
Nice to Have Skills:
Exposure to Azure AI Search, Azure Doc Intelligence, Azure OpenAI, AWS Textract, AWS Open Search, AWS Bedrock.
Familiarity with LLM backed agent frameworks like Autogen, Langchain, Semantic Kernel.
Interest in chatbot design and development.