Will Diamond - AI Engineer

Los Altos, CA, United States | will@diamondquarters.com

LinkedIn

Github

Ask Abe

A

PDF Resume

Download

Profile

I am an AI Engineer with specialization in building full-stack AI applications that utilize LLMs . I have experience in building data pipelines which utilize LLMs to conduct large scale web scraping, data processing, and structured metadata extraction to populate and prepare databases as knowledge bases for RAG. I have proficiency in utilizing Python, Pydantic, PostgreSQL, and other LLM libraries such as Instructor to build prompt pipelines to extract structured data at scale. I have experience building APIs used by AI assitants and generative AI web applications in React and NextJS.

Work Experience

Contract AI Engineer, Contoural Inc.

Nov 2023 — Present

  • Effectively led Contoural's AI Citation Extraction Pilot Project.
  • Designed and implemented the project plan.
  • Developed and maintained production prompt pipelines.
  • Coordinated with domain experts to build effective eval sets.

Happy to provide references on request.

Founder and CEO, Recodify.AI

Aug 2023 — Aug 2024

  • Researched innovative methods to transform legislation into optimal schema for use with RAG and AI agents, and custom tooling/APIs to access it.
  • Created scraping engine technology to automatically extract, process, and prepare primary source legislation for use with LLM pipelines.
  • Developed diverse AI web applications for displaying, chatting with, and understanding primary source legislation for end users.
  • Scraped and processed 47 US States and Federal legislation.

Programming Languages

Experience

Relative experience shown.
See Statistics page for more detailed statistics.

Core Dev Tools & Frameworks

Frontend

NextJSReactShadCNTailwindCSSSeleniumBeautifulSoup

Backend

PydanticInstructorFastAPIPostgres (with JSONB)PsycopgPG-Typed

Interactive Career Timeline

Growing Up In the Bay Area

Los Altos California

I was born and raised in Los Altos, CA, in the heart of Silicon Valley. I was very fortunate to have been raised in a good neighborhood with a loving and supportive family. I was immersed in a culture of innovation and entrepreneurship from a young age. I got good grades at Los Altos High School, played football for 4 years, and took multiple computer science courses.

Discovering my Love of Programming

Scratch

I loved the feeling of solving puzzles and the reward of building things. My first project was a 2D golf simulator built in Scratch using physics equations I had just learned in my AP Physics class! I was hooked from there, and decided to pursue a degree in Computer Science in college.

Undergraduate Computer Science

MSU Engineering

I attended university at Michigan State as an undergraduate CS major. I chose Michigan State because I loved the traditional midwest college experience, and because I got a partial scholarship to go there. There I learned Python, C++, and of course fundamental data structures and algorithms.

Switching Majors to Data Science

NFL Statistics

After taking some Data Science electives, I quickly found my true calling in data science—an area where the practical application of technology to solve real problems ignited my passion. One of my favorite projects was a statistical analysis of NFL statistics and the coefficient of correlation with number of wins. Not that I didn't like the classic algorithms and data structures of CS, but I got to model and solve real world problems in Data Science. And they actually taught us how to use git (Shocker that the CS program hasn't changed in 20 years!)

First Hands on Experience with ML and AI

SVM Results

As I progressed through my data science degree I got to take more classes in AI and Machine Learning, which was just the coolest thing ever to me. I knew I always wanted to work in AI and strongly believed that the hardest and most interesting problems could be solved with AI technology. (Notably this was pre ChatGPT). My capstone project worked with Argonne National Laboratory building ML models to predict moments of high volatility in fluid dynamics datasets. My SVM model performed the best, which our group chose to use for our final results and presentation!

Post Graduation Struggles

I hate leetcode

Post-graduation, the transition from academia to the professional world was unexpectedly challenging.

Despite a robust educational background, the job market proved unforgiving, and repeated rejections soon took their toll, leaving me demoralized and questioning my path. Grinding leetcode was something that I seriously struggled with. I became disillusioned with the entire process, feeling like I was wasting my time and making no progress. It was during this period of introspection that I decided to take a step back.

I spent a year coaching JV football at Palo Alto with my twin brother, which was a unique and fun experience I'm glad I did. After the season I returned to my old college job of bartending in downtown Mountain View, in order to make ends meet and help pay my way as I was living at home with my parents.

A Lifechanging Act of Kindness

Bartending at scratch

Disillusioned with leetcode grinding and depressed from the college grad job search, I found myself working 40-50+ hours bartending and wondering how I could start my career.

The turning point came from an unexpected source—a chance encounter at the bar with a random person at my bar one night. His name was Sean, and he was unexpectedly kind: asking about my education and eventually career goals. Sean was also a programmer, with a lot of software engineering experience.

He asked me if there was any idea or dream I wanted to work on or build: to which I said yes. For 5 years I've had the idea to build some kind of aplication to help regular people understand the law and the rights that apply to them. On the spot, Sean invited me to come code with him and some friends that Sunday at a local coffee shop. This group, meeting every Sunday to hack/collaborate on projects and share ideas, was named Sunday Hustle.

A Second Chance

Commits after joining Sunday Hustle

I seriously debated if I wanted to go, but reason took over and I realized it could be the spark I needed to restart my efforts to break into tech and get a job. It was the best decision I ever made. I met a group of passionate, kind, and experienced software engineers and AI enthusiasts. Above you can see my github contributions for 2023. It's very clear when I started going to Sunday Hustle.

Sunday Hustle

Commits after joining Sunday Hustle

I started regularly going to Sunday Hustle every week. At my first Sunday Hustle, I didn't know what to work on. I felt self-conscious of my coding ability and thought that I needed to spend some time doing leetcode to get back to speed.

Sean immediately told me this was a dumb idea (something I love about him lol). This is where I first got introduced to the hacker and builder mindset, something I am proud to say I have today. Sean was of the strong opinion that learning by doing was the optimal approach. After hearing my ideas for my dream project (which had no name), he suggested I look into implementing LLM technology and pointed to me something called "vector embeddings" and the pgvector library for Postgres.

Above is the picture of my second ever Sunday Hustle with Sean

Sunday Hustle Pt 2

Sunday Squad

I learned so much so fast, surrounded by great people. I felt so incredibly lucky. I readily accepted the hacker/builder mindset.

I got so much experience using LLMs by building, in the very early days of the technology. (Pre GPT-4 if you can imagine!)

I spend so much time talking about Sunday Hustle because it's been so important to me. A lot of the older members, especially Sean, became not just friends but people I consider mentors to me. Sean, ever humble (almost to an annoying degree), to this day claims his role in helping me get back on my feet is negligible, although I attribute his kindness as the reason I'm on my journey to becoming an AI Engineer today. Below is a picture of the group, photo creds Mark.

Building My Dream Project

SQL table for California statutes

Every Sunday marked an improvement in my capabilities as a software engineer and an increase in my experience using LLM technologies. My dream legal help project started to take shape into a fully functional LLM chatbot. The goal of the project was to democratize access and understanding of legal knowledge to all.

A user could ask simple questions to the chatbot, which would search a database of scraped legislation (more on this feat later) to find relevant legislation. It would give legal education and information (never advice, don't sue me) in a format understandable to all. Included in its answer would be direct citations to the legislation, as well as the ability to view exact text and visit the official .gov website.

Ask Abe - My First Full Stack AI Application

Ask Abe Website

I gave my LLM the persona of Abraham Lincoln, always honest and truthful. Building trust in LLM responses was a big deal back then, and still remains a problem today in critical domains such as legal.

I learned so much about embeddings, similarity search, rag pipelines, optimizations to user queries with LLMs, chain of thought reasoning and prompting, and limitations to LLM technology. Things like retries, LLM rate limits, cost/accuracy optimization, and evals were all completely new concepts. The myriad of LLM errors in a complex system like this was equally difficult and fun. It felt like solving puzzles within puzzles, where the inner workings of LLMs and the process of optimizing responses felt half science half magic.

I loved it.

Eventually, I went to a Stanford LLM hackathon to showcase Ask Abe. I met so many friends and connections that helped me make my passion project into a career. I quit my job bartending, and started working full time on my passion project.

Starting Recodify.ai

SQL table for California statutes

I founded my first startup based on the work I had been doing with Ask Abe. The idea was to build a platform to help AI Engineers build LLM applications in the legal field. I asked my data science friend from MSU, Madeline, to be my cofounder. We would provide extensively processed primary source legislation data and tools to streamline the difficult task of building applications. And to start out we would offer our consulting services to help companies build AI applications in the legal field using our IP.

The core of our business was providing access to an SQL database of global scraped legislation. In the process of building Ask Abe, the most time consuming process was finding accurate and usable source data for RAG. I looked far and wide for open source legislation data and found nothing. There were many open source platforms for finding judicial court cases and decisions, but nobody could provide the actual legislation that serves as the basis for all of our laws and regulations. Any AI application built in the legal field is almost certainly built only with this judicial data, or serves to help lawyers and attorneys conduct their work more efficiently. The rare few legal AI applications which could analyze legislation data were owned and operated by the big players in this game: CaseText, LexisNexis, and Thompson Rheuters.

Democratizing Legal Knowledge

SQL table for California statutes

These big players had such a monopoly on these AI applications because they had a monopoly on the data. We aimed to remove this barrier, level the playing field, and provide all the data and tools an AI Engineer would need to start building instantly. We wanted to democratize legal knowledge to allow for the construction of more powerful and wide spread AI applications.

This startup alligned with the mission of Ask Abe, and the whole reason I started building this platform and technology in the first place.

Open Sourcing My Work

SQL table for California statutes

Finding product market fit was difficult, not to mention the technical and practical implications of offering up the world's legislative data from an API. After a lot of debate and hard thinking, I decided to open source my work and shut down the startup to focus on my career as a software engineer and AI Engineer.

Through my work with recodify I was able to get contract AI Engineering work for Contoural, utilizing the IP and my experience to build LLM applications with real business value. (Users for my AI applications? Crazy!)

AI Engineering Work with Contoural

SQL table for California statutes

Because of our extensive IP of scraped and processed legislation, we were able to get our first client: Contoural

Contoural is an information governance consulting company which does extensive work consulting clients on how to handle large amounts of data to maintain legal and regulatory compliance.

Building an LLM System to Extract Recordkeeping Requirements

SQL table for California statutes

We were brought on to Contoural to build an AI system for extracting Recordkeeping Requirements with higher accuracy and quality metadata than anything on the market (very niche market)

This complex AI system pushed the boundaries of LLM capabilities for reasoning and structured data extraction in the legal field.

I gained so much experience building production ready LLM applications. Creation of extensive LLM eval sets by domain experts was the most valuable experience gained.

Building LLM Tools with Immediate Business Value

AI Citation Helper

Besides working on the main Recordkeeping Requirement AI Extraction project, I also had opportunities to interview and shadow employees to better understand their work flows. I diagnosed a couple of areas of work where LLMs could be incredibly useful for increasing efficiency.

One of these LLM applications I built took in large amounts of legal citations data to pre-sort and classify each citation's relevance to a given client.

This self contained LLM application was able to reduce an attorneys time to complete this kind of soul crushing legal research and analysis work by 75%. This not only saved Contoural a lot of expensive domain expert billable hours, but significantly reduced some of the most hated work done by these domain experts.

My Numerous Side Projects

AI Citation Helper

Throughout my journey as an AI engineer, I have worked on numerous side projects. Full transparency, there's a lot of half finished implementations! (A common problem)

I learned to love building. Be it NextJS websites (love/hate sometimes), custom AI engineering tools, or experimentations into LLM capabilities. The spirit of Sunday Hustle's "build something and demo it" lives on.

You can view more of my projects in more technical depth at the link below. I will continue to build, ship, and demo. Who knows what I'll build next Sunday!

The Future?

AI Citation Helper

Although I loved founding Recodify.ai, and building LLM applications for Contoural, I've always felt I hadn't reached my potential. I'm currently looking for full time employment in software engineering and AI Engineering.

I want to work on interesting and difficult projects.

I want to work in a fast paced environment, BUILDING! Shipping fast, and contributing as much as possible.

I want to be a part of a team building great things. Surrounded by passionate, experienced (cracked), and motivated people.

I want to work somewhere I can learn as much as possible by doing.