Senior Data Engineer
Grey Matters Defense Solutions is a specialized firm in software development, data analytics algorithms, and advanced remote sensing technologies, tailored to meet the complex demands of defense and intelligence sectors. Staffed by a diverse range of professionals—from senior-level experts from organizations like the Defense Intelligence Agency (DIA), National Reconnaissance Office (NRO), Defense Advanced Research Projects Agency (DARPA), and the U.S. Armed Forces, to recent graduates and military recruits—the company prides itself on a culture of diversity and inclusion. Grey Matters Defense Solutions sets itself apart with a startup-like corporate environment that values flexibility and collaboration, offering flexible schedules, hybrid work options, and frequent team events and company outings. By integrating the skills of subject matter experts, analysts, software engineers, and data scientists, Grey Matters Defense Solutions delivers unique artificial intelligence algorithms and applications, positioning itself as a leader in innovative solutions for defense and intelligence.
We are seeking a Senior Data Engineer to join our data science group at Grey Matters Defense Solutions. The ideal candidate will work closely with data scientists and machine learning engineers to build and maintain scalable, efficient data pipelines, manage metadata storage, and deploy machine learning models. You will integrate multiple data sources, handle data format conversions, manage tagging tools, and ensure the smooth operation of machine learning workflows. This includes setting up scalable, automated pipelines for data ingestion, preprocessing, and annotation, as well as implementing robust systems for model versioning, deployment, and monitoring.
Primary Responsibilities:
Data Pipeline Development: Build and maintain data pipelines that handle the ingestion, conversion, and processing of raw data into formats suitable for machine learning models.
Metadata and Tagging Management: Implement tagging and annotation tools to enhance data used for model training, ensuring efficient metadata storage and retrieval.
Database Management: Design and maintain databases for storing structured/unstructured data, including tags and metadata.
Preprocessing & Automation: Automate the preprocessing of data (e.g., cleaning, normalization, augmentation) to prepare it for neural network models.
MLops Integration: Work with machine learning engineers to implement end-to-end machine learning workflows, integrating data pipelines with model training, deployment, and monitoring processes.
Model Deployment and Monitoring: Set up and manage infrastructure for deploying machine learning models, including maintaining inference servers and continuous integration pipelines.
Model Versioning: Implement model version control and management systems to track experiments and ensure smooth transitions between model iterations.
Qualifications:
- Eight (8) years experience in a Data Engineer role
- Active TS (Top Secret) clearance.
Experience with machine learning workflows, including data pipelines and model deployment.
Familiarity with working with unstructured and structured data, converting them for use in machine learning models.
Strong understanding of MLops practices, including model versioning, monitoring, and CI/CD for machine learning models.
Experience in scaling infrastructure to handle large datasets and multiple models in production.
Skills:
MLops Tools: Experience with MLops tools and platforms like Kubeflow, MLflow, or Seldon, including model tracking, deployment, and monitoring systems.
Data Engineering Tools: Airflow, Bash, Docker, Docker Compose, GDAL, Git, Linux, make, MongoDB.
NVIDIA Ecosystem: Expertise in NVIDIA installations (CUDA, cuDNN, Drivers, NVIDIA CONTAINER TOOLKIT).
Python: Expertise in Python (Dask, Faker, Jupyter, NumPy, pandas, pydantic, pymongo, pytest).
Data Conversion and Processing: Experience with data conversion libraries such as Rasterio and RAY.
Web & API Development: Experience with Traefik and hosting tools.
Database Management: Strong experience with managing metadata and tagging in databases like MongoDB, with scalable storage solutions.
Salary Range: $165,000 - $200,000 + 25% SEP
Grey Matters Defense Solutions offer a comprehensive benefits package including medical, dental, vision, life insurance, short-term and long-term disability.
Additional Benefits:- SEP IRA 25% of base salary
- PTO Six weeks
- IBA 12.5%
- Employee assistance program
- Employee discount
- Flexible spending account
- Health savings account
- Referral program
Grey Matters Defense Solutions’ most valuable assets are the more than 60 employees, consisting of data scientists, custom software developers, and analysts/subject matter experts, with senior-level personnel formerly from DIA, NRO, NSA and the US Armed Forces. Our employees have a depth of analytical knowledge which provides them with deep understanding of managing and delivering products within government systems.
Grey Matters Defense Solutions provides transformational leadership building aware-winning teams and products. - Join our team of exceptional developers, architects and data scientists!
Visit us at Grey Matters Defense Solutions
https://www.linkedin.com/company/grey-matters-defense-solutions/
“Know Your Rights: Workplace Discrimination is Illegal”
Questions contact: [email protected]