Ian P. Thomas

ian@scraemon.com


Profile


Skills and Tools


Relevant Experience

Sr. Software Engineer - Tech Lead August 2022 - October 2024
  • Built and orchestrated a modern data stack comprised of best-in-class Extract, Load and Transform (ELT) components, running both in Kubernetes and AWS, supplying our Data Lake House;
  • Built, as well as provided expert guidance on how to build, data-driven products leveraging our data stack;
  • Designed and implemented event-driven microservices, developer productivity packages, and a service monitoring framework;
  • Designed and implemented cloud infrastructure using Terraform, AWS Cloud Development Kit (CDK) and Cloud Formation Templates;
  • Mentored junior developers;
Sr. Software Engineer March 2022 - July 2022
  • Designed and implemented the following solutions:
    • Python libraries that facilitated API Authentication; interacting with MS Graph; Application Configuration and auto-generation of Data Classes from MS Graph JSON schema exemplars;
    • Stateless Azure Function App products;
    • The foundations for a complete CI/CD pipeline leveraging GitHub Actions Workflows.
Sr. Software Engineer November 2020 - March 2022
  • Designed and implemented the following solutions:
    • On-premise Elastic Stack on-boarding automation microservice: provided Kibana credentials, Kibana Space, Elasticsearch index pattern; and custom Filebeat and Metricbeat configuration files;
    • JPA implementation-agnostic Spring Boot Persistence starter library that provided a lightweight testing database, audit abilities, and configuration driven soft-delete functionality that leveraged the Reflection API and generated source code compiled and loaded at run-time;
    • Spring Boot Metrics starter library to facilitate transmittal of application run-time metrics to Elastic Stack that provided custom Spring Actuator endpoint functionality that interfaced seamlessly with the Metricbeat HTTP module;
    • Java development environment setup automation script to streamline on-boarding;
  • Collaborated on the design and implementation of an ACME protocol-like SSL certificate life cycle management Spring microservice and Certbot-like client library leveraging the Jetty web server's ability to reload its SSL context in-flight;
  • Initiated, led, and completed the design, implementation and documentation of the organization's internal use of Docker on WSL2 on VMware VMs to facilitate cloud-native application development;
  • Led or coordinated multiple agile swim lanes and mentored junior developers;
Sr. Big Data Engineer September 2019 - October 2020
  • Designed and implemented the following solutions:
    • Automated Clearing House (ACH) file conversion tool that identifies, logs, and removes incomplete records during conversion to delimited flat-files for later incorporation into Hive;
    • Hadoop data comparison tool that automates the identification of even the smallest of discrepancies between very large data sets in separate Hadoop environments in a very efficient manner;
    • Script Build and Installation tools to facilitate sharing of Bash applications;
    • Bash Unit-testing library to facilitate testing, refactoring and maintainability of Bash applications and libraries;
    • Hive, SQLite and Teradata Database Interface (DBI) libraries for Bash to aid in ETL Pipeline verification and testing;
    • Functional Programming library for Bash that eases the incorporation of the most commonly used functional programming functions into Bash applications;
    • COBOL copybook parser and SQL DDL generation tools that automates the generation of COBOL load files, tables and views from a single copybook;
  • Initiated and led numerous knowledge-transfer sessions with colleagues to better understand, and begin documenting, the numerous processes and standards used to create ETL pipelines for the Enterprise Data Warehouse (EDW);
Sr. Data Consultant April 2018 - August 2019
  • Designed and implemented the following solutions:
    • Extensive, multi-data vendor analysis tool that incorporated and normalized disparate vendor pricing models and non-uniform vendor enriched data outputs into a single, coherent cost model with comparable enrichment results, whose output served as the foundation of a presentation I provided to senior management, with recommendations as to which vendor to partner with;
    • PII data quality detection and remediation proof of concept that automated the detection of data value variances by leveraging the IRS's Online Bulk TIN Match program, and generated actionable reports to facilitate remediation;
    • ETL applications that sourced data from Oracle and Teradata, wrangled it in Pandas, and generated custom, actionable reports, in order to either facilitate remediation of data quality issues or demonstrate the value of the bank's data;
  • Led an agile team that was able to deliver a production-ready web-based front-end to the Bank's Master Data Management system in just a few months;
  • Presented to groups, and mentored employees, on how to leverage Python for data analysis, test automation, process automation, and reporting;
Manager of Data Services December 2012 - November 2017
  • Established organization-wide, best-in-class data analysis capabilities by expanding the types of processes the organization was able to perform in-house, from simply data exploration, to data retrieval, preparation, exploration and presentation;
  • Pioneered the use of Platform as a Service (PaaS) to access the legal industry's leading data analysis platform - eliminating both hardware and full-time administrator costs, while retaining complete access to the platform and allowing the full access cost to be passed through to clients;
  • Extended the functionality of existing third-party software through integration with other third-party and custom-built software allowing our clients to realize an average of 15% savings in data analysis costs on large volume data analysis projects;
  • Designed and implemented software to perform the following functions:
    • Statistical analysis of myriad metadata associated with, and extracted from, large volumes of documents;
    • Extraction, transformation and loading (ETL) of data matching complex criteria from both binary and text-based data stores to enable further analysis via a browser-based interface;
  • Comprehensively managed third-party vendor relationships including, but not limited to, defining vendor selection criteria and ongoing performance metrics, negotiating pricing and identifying points of collaboration;
  • Trained attorneys and paralegals to better incorporate and optimize the use of software in their workflows, resulting in increased productivity;
  • Researched new technologies, participated in online forums, and attended conferences in order to stay current with the latest industry technologies including supervised and unsupervised machine learning assisted data review;
Sr. Data Analyst April 2012 - November 2012
  • Designed and implemented software to perform the following functions:
    • Facilitate the export and conversion of voluminous document metadata via a poorly documented jRuby API into a format compatible with our client’s data analysis system;
    • Verify the completeness and correctness of structured and semi-structured data received from third-parties;
Data Analyst June 2011 - April 2012

Lipsitz Green Scime Cambria LLP

  • Designed and implemented software to perform the following functions:
    • Catalog, search and sort the firm’s legal research publication subscriptions via a graphical user interface;
    • Cost analysis of those subscriptions via custom cost reporting;
  • Identification, extraction and tabulation of relevant evidence from large volumes of electronic documents for the purpose of calculating damages;
  • Provided expert guidance on decisions where both legal and technology issues were intertwined; e.g., choice of intra-firm e-mail archival system to facilitate compliance with information governance requirements, and smartphone selection based on typical attorney usage patterns;

Education

Doctor of Law (J.D.)

Master of Environmental Law and Policy

Bachelor of Arts (B.A.) in Computer Science; Minor in Accounting