David Vedvick

Notes

Open Source 2025 Notes

Throwback Thursday: Old School Optimization Using Newfangled Machine Learning

Linear Programming

  • "Programming" in the sense of scheduling.
  • Inequalities carve out regions of space. These are called "constraints".
  • Objective function: goal we are trying to maximize.

Linear Optimization Facts

  • Solvable when bounded.
  • Min-Max duality.
  • RPGs: Max strength == min weakness
  • Deterministic - guarantees best answer given the input. Is susceptible to garbage input.

When to use

  • Making the best choice out of many available while satisfying constraints.
    • Resource allocation.
    • Matching.
    • Routing problems.
    • Inventory decisions.
  • Best fit for batch decisions under constraints.

Expected Outcomes

  • Better use of resources.

  • Stable operations planning.

    • Better inventory management
  • Measured risk, esplainable, and respects your contraints. Can be built quickly with a small team.

  • Plays nice with forecasted values: making best prediction with the data available.

  • Any vertex of the feasible region has a cone of normal vectors where it is the optimal solution.

  • Strength of forecast affects the feasible region.

How to Make One?

  • Pyomo functions like an ORM for interfacing with solvers.
  • CBC (COIN-OR Branch Cut) to solve.

React at Work Post-Create-React-App

  • Provided testing, bundling, and a dev server built-in.
  • "Kill it with Fire" book looks interesting.
  • Create React App officially deprecated in February 2025.
    • React 19 doesn't work well with it.
    • React is becoming a framework.

Alternatives

Next.js

Cons:

  • Tight couplign of backend and frontend.
  • Built-in server runtime

Remix

  • Also tightly coupled.
  • Gnarly nested routing structure.

Bun

  • Really fast bundler.
  • Too much magic.
  • Incomplete ecosystem compatibility.

Vite

  • Fast, flexible, and friendly
  • Pretty easy to migrate from CRA.
  • Static by default.
  • Unbundled dev server, optimized builds.
  • Can use existing rollup plugings.

Open Source Tooling and Best Practices to Improve Vulnerability Management

  • VM: Vulnerability Management
  • --Identification -> Reporting -> Evaluation -> Prioritization -> Remediation -- ∧ | | ∨

  • Competency trap: people don't use new tools because building competency in a new tool is challenging.
  • Mend Renovate: formerly Renovate.
  • Renovate bot can work off of dependencies defined in comments in a dockerfile.

Beyond the Chatbot: Delivering Business Value with LLMs

  • According to IBM "Institute of Business Value", only 25% of AI initiatives have delivered expected ROI.
    • 16% have scaled enterprise wide.
    • IBM's Advice: ignore FOMO, lean into ROI.
  • When to chatbot:
    • Onboarding users (employees/customer) to complex systems.
    • When users are lost, confused, or not even sure what they need.
  • Need metrics
    • conversion rate
    • % requests routed to a human
    • manual time saved
    • response/execution times.
  • When not to chatbot:
    • Simple forms
    • When users are experts
    • LLM Assisted Automation (agents) can provide value by performing tasks on triggers.
    • Using AI to capture business that you don't have the number of people to take in the business?!?!
  • Solving problems with GenAI:
    • Classifying unstructured data.
      • What kind of document is it? What needs to be done with it?
    • Convert unstructured so structured data?
      • Parsing quote requests, emails, etc.
      • What is Zeiss IQS doing in this space?
    • Translate similar data between systems:
      • Referencing information from external vendor systems.
  • Why was something not already automated?
    • Complex SOPs
    • Unstructured data
    • Parsing information from multiple internal/external services

Measuring GenAI Solutions

LLM Evals

"Answer true or false" is the key.

  • Example:

SYSTEM: Act as a metallurgy expert. Do not explain your answer. Answer true or false.

USER: Steel is an alloy of iron and carbon?

Easy to apply to any business domain. Ask experts: what are thoughest questions you're asked? What are the perfect 20 year veteran answers? Then let the expert decide when the AI is trustworthy enough.

Metrics Driven Development

  • Answer correctness
    • True/false, multiple choice questions
    • More complex questions:
      • Determine ground truth
      • Take intersection of model answers and ground truth and divide by intersection + model + ground truth.
        • AnswerIntersections / (AnswerIntersections + Ground Truth + Model Answers)
  • Faithfulness - degree to which outputs align with facts
  • Relevancy
  • Context recall
  • Arize Phoenix: LLM evaluation tool
  • guardrailsai.com

These tools are generally used to test existing LLM's, not to fine tune or create a model. Most things that are changed are the context that is fed in, or the prompt that is given.

Note posted on Wednesday, May 28, 2025 7:17 PM CDT - link