Open Source 2025 Notes
Throwback Thursday: Old School Optimization Using Newfangled Machine Learning
Linear Programming
- "Programming" in the sense of scheduling.
- Inequalities carve out regions of space. These are called "constraints".
- Objective function: goal we are trying to maximize.
Linear Optimization Facts
- Solvable when bounded.
- Min-Max duality.
- RPGs: Max strength == min weakness
- Deterministic - guarantees best answer given the input. Is susceptible to garbage input.
When to use
- Making the best choice out of many available while satisfying constraints.
- Resource allocation.
- Matching.
- Routing problems.
- Inventory decisions.
- Best fit for batch decisions under constraints.
Expected Outcomes
Better use of resources.
Stable operations planning.
- Better inventory management
Measured risk, esplainable, and respects your contraints. Can be built quickly with a small team.
Plays nice with forecasted values: making best prediction with the data available.
Any vertex of the feasible region has a cone of normal vectors where it is the optimal solution.
Strength of forecast affects the feasible region.
How to Make One?
- Pyomo functions like an ORM for interfacing with solvers.
- CBC (COIN-OR Branch Cut) to solve.
React at Work Post-Create-React-App
- Provided testing, bundling, and a dev server built-in.
- "Kill it with Fire" book looks interesting.
- Create React App officially deprecated in February 2025.
- React 19 doesn't work well with it.
- React is becoming a framework.
Alternatives
Next.js
Cons:
- Tight couplign of backend and frontend.
- Built-in server runtime
Remix
- Also tightly coupled.
- Gnarly nested routing structure.
Bun
- Really fast bundler.
- Too much magic.
- Incomplete ecosystem compatibility.
Vite
- Fast, flexible, and friendly
- Pretty easy to migrate from CRA.
- Static by default.
- Unbundled dev server, optimized builds.
- Can use existing rollup plugings.
Open Source Tooling and Best Practices to Improve Vulnerability Management
- VM: Vulnerability Management
--Identification -> Reporting -> Evaluation -> Prioritization -> Remediation -- ∧ | | ∨
- Competency trap: people don't use new tools because building competency in a new tool is challenging.
- Mend Renovate: formerly Renovate.
- Renovate bot can work off of dependencies defined in comments in a dockerfile.
Beyond the Chatbot: Delivering Business Value with LLMs
- According to IBM "Institute of Business Value", only 25% of AI initiatives have delivered expected ROI.
- 16% have scaled enterprise wide.
- IBM's Advice: ignore FOMO, lean into ROI.
- When to chatbot:
- Onboarding users (employees/customer) to complex systems.
- When users are lost, confused, or not even sure what they need.
- Need metrics
- conversion rate
- % requests routed to a human
- manual time saved
- response/execution times.
- When not to chatbot:
- Simple forms
- When users are experts
- LLM Assisted Automation (agents) can provide value by performing tasks on triggers.
- Using AI to capture business that you don't have the number of people to take in the business?!?!
- Solving problems with GenAI:
- Classifying unstructured data.
- What kind of document is it? What needs to be done with it?
- Convert unstructured so structured data?
- Parsing quote requests, emails, etc.
- What is Zeiss IQS doing in this space?
- Translate similar data between systems:
- Referencing information from external vendor systems.
- Classifying unstructured data.
- Why was something not already automated?
- Complex SOPs
- Unstructured data
- Parsing information from multiple internal/external services
Measuring GenAI Solutions
LLM Evals
"Answer true or false" is the key.
- Example:
SYSTEM: Act as a metallurgy expert. Do not explain your answer. Answer true or false.
USER: Steel is an alloy of iron and carbon?
Easy to apply to any business domain. Ask experts: what are thoughest questions you're asked? What are the perfect 20 year veteran answers? Then let the expert decide when the AI is trustworthy enough.
Metrics Driven Development
- Answer correctness
- True/false, multiple choice questions
- More complex questions:
- Determine ground truth
- Take intersection of model answers and ground truth and divide by intersection + model + ground truth.
- AnswerIntersections / (AnswerIntersections + Ground Truth + Model Answers)
- Faithfulness - degree to which outputs align with facts
- Relevancy
- Context recall
- Arize Phoenix: LLM evaluation tool
- guardrailsai.com
These tools are generally used to test existing LLM's, not to fine tune or create a model. Most things that are changed are the context that is fed in, or the prompt that is given.
Note posted on Wednesday, May 28, 2025 7:17 PM CDT - link