Location: of this role is hybrid with flexible work locations. We are a global team, located in London, San Francisco, Toronto, and remotely. Autodesk is a hybrid-first company, allowing workers to work remotely, in an office, or a mix of both.
Responsibilities
Collaborate on engineering projects for product with a diverse, global team of researchers and engineers.
Develop scalable distributed systems to process, filter, and deploy datasets for use with machine learning.
Process large, unstructured, multi-modal (text, images, 3D models, code snippets, metadata) data sources into formats suitable for machine learning.
Conduct and analyze experiments on data to provide insights.
Produce data visualizations and summaries to communicate data characteristics to researchers and leadership.
Work with our legal and trust teams to ensure compliant and ethical use of data.
Develop data pipelines into secure remote environments respecting and demonstrating security best practices.
Write testable code that is well documented and easy to understand.
Analyze errors and provide solutions to problems that arise.
Minimum Qualifications
BSc or MSc in Computer Science, or equivalent industry experience.
3+ years of experience with software version control, unit tests, and deployment pipelines.
You have experience in data modelling, architecture, and processing skills with varied data representations including 2D and 3D geometry.
You have experience with:
Cloud services & architectures (AWS, Azure, etc.).
Documenting code, architectures, and experiments.
Relational databases (e.g., MySQL, PostgreSQL) and NoSQL databases (e.g., MongoDB, Cassandra).
Frameworks such as Ray data, Metaflow, Hadoop, Spark, or Hive.
Vector data stores.
Proficiency with Linux systems and bash terminals.
Knowledge of cloud architectures and networking
Additional Qualifications
Experience with computational geometry such as mesh or boundary representation data processing.
Experience with CAD model search and retrieval, in PLM systems or other searchable CAD databases.
Knowledge of statistics.
You have experience analyzing data and communicate results using tools such as Pandas, Matplotlib, Seaborn, Plotly, R or others.
You have knowledge of the design, manufacturing, AEC, or media & entertainment industries.
Experience with Autodesk or similar products (CAD, CAE, CAM, etc.).