I Don't Like Data Entry

I’ve put on a bit of weight over the last couple of years despite eating less and working out more ^*. My wife (who is an NP) had the epiphany that it’s probably the anti-anxiety meds I’ve been on for…about two years. Cut to me on the phone to my GP^* about it on a Friday afternoon, on the call with headphones in, single-crewed-parenting our son Arthur on a sunny day.

Arthur grabs my hand and drags me out the front door because the ice cream van has just turned up. (The ice cream van comes every day, and negotiations about getting ice cream happen every day.)

I’m midway through telling the doctor my diet is genuinely excellent and I do exercise, and she’s listening patiently^*. The ice cream van starts pulling away before we get to it and I bellow, at the top of my lungs, “COME BACK MR ICE CREAM VAN MAN.” Then I get back on the phone and carry on telling the doctor my diet is excellent.

I’m sharing this partly because it’s about to become load-bearing for everything that follows, and partly because it’s the life event that set all of this in motion. Mostly, though, I’m sharing it because it’s a genuinely funny story and I know I need to tell it.

For the record.

We (me, wife, doctor) figured out that the anti-anxiety meds were probably treating the wrong condition; what I had looked more like ADHD showing up as anxiety. The reason it matters here is because it forced me to understand how my brain works, and how it doesn’t. Seeing the effect of Adderall has made it easier to notice the pattern and thread of work that I’ve been solving in my subconscious for the last decade in various forms - and how it led to Flexcompute Thread.

Fundamentally, I’ve realised that my career has included one long attempt to avoid pointless data entry. Not because I’m above it, but because data entry is boring, humans are error-prone, and there’s almost always a better way. I love Autofill on my iPhone - especially because we move a lot as a military family and I’ve had four zip codes with 6’s, 0’s, and 1’s and I find it hard to remember which one was which on forms.

In engineering systems, it’s annoying because data and its relationship to other data has a shape^*. That shape got lost somewhere between two tools, or two teams, or two formats, and a human got handed the reconstruction job and lost the shape and context at the same time. This is what’s going on when you find yourself typing the same thing into a second box, or hard coding a constant, or exporting to file and re-importing into another tool, or filling your name and telephone number back out on a repeat visit to the doctor.

There’s a smaller, more personal version of this realisation that I had years before any of it got technical. I’ve always tried to automate tasks that I knew didn’t interest me — because boredom produces errors, and because the challenge of figuring out an automation is fun. Data entry is the worst case: both boring and error-prone. Richer data^* is the opposite. Richer data is more interesting, and when there’s structure, wrong answers look and feel wrong, because they break the structure - which you can only do when you have the ability to observe all data together. Contrastingly, if you typo a number into a CSV or an Excel spreadsheet, the mutated data is just as happy as it would have been with the right number^*.

My ADHD diagnosis didn’t tell me that. It just made it obvious to me: avoiding boring error-prone work hadn’t been laziness^*, it had been my brain’s natural response to a class of work that uses none of the things my brain is good at.

This is a post about that pattern, and my personal realisation. It ends, for now, at a thing called Flexcompute Thread - and I give a glimpse at why it’s magical. There’s also a Part 2/sister article about why AI agents are ideal for generalists.

I am also aware that this writing is long and not everyone^* will read it in full. The act of writing this and publishing it is really for me, because it’s helping me to get my thoughts out and forcing me to structure them.

The instinct in plot form

Why I sent the dashboard, not the deck.

I was annoyed during my PhD at having to embed PNG figures into a PDF, and my small act of ‘rebellion’ to that fact was to spend weeks learning how to use PDF_TEX and LaTeX to ensure that typography matched the rest of the document, and that captions and annotations in figures were text-searchable^*. At ARA^*, I battled the CTO^* about why I had to use Microsoft Word when it’s clearly the wrong tool for engineering reports. At IIT, I wrote and proselytised with scripts to automate grade submission to the Senate^*. At Aurora, I authored flight-dynamics studies and was expected to put the results into PowerPoint. I refused. Not dramatically; I just kept building Plotly + Dash dashboards and sending those round when people asked for the slides. They could interrogate the data without using me as an intermediary.

I’d send both. The deck got circulated; the dashboard got bookmarked by three people and forgotten.

I realise now, that my recalcitrant objection was philosophical and topological^*. Data has a shape — a graph of relationships between inputs, outputs, regimes, edge cases — and a PowerPoint slide is a flat projection that loses everything that makes data rich. You can project any data onto rectilinear axes the same way you can project a globe onto a flat map. You cannot reverse this projection, and you cannot recover what is lost.

A PNG is a snapshot. The data was already moving by the time the flash went off.

This is the first instance of a pattern that’s about to keep happening: the topologically-guided output is the better way to keep data alive and the rectilinear-shaped output is the one that most systems can absorb, including human beings.

That gap between them and what a crude transfer throws away is the whole post.

Paradigm

Two coupled systems, one model — and a prize that didn’t move the system.

The bigger version of this, the one that mattered most professionally, was a project called Paradigm^*. I had been at Aurora for a couple of years and got seconded to Boeing’s Sustainability and Future Mobility team, working on Future Aircraft Conceptual Design. I was brought in to do Stability and Control Estimates^* and finished that quickly, but questioned “how do we know our requirements are right for design of something that’s never been designed before and doesn’t fit into the niches currently there for aviation?”

Boeing has a tool called Cascade, built by the sister team of the one I was on. Mathematically clean, built by people who knew what they were doing, and it lets you ask things like if we replaced this fleet of 737s with this hypothetical hydrogen aircraft, what happens to global aviation emissions? — over real flight data, at planetary scale. What it does, mathematically, is swap aircraft on routes and recompute. Hand it a fleet plan, get the emissions consequence. Answers that question well, and provides as answer that people understand.

The question I kept getting stuck on, which Cascade didn’t, was the one underneath: which aircraft can fly from which airports? A hydrogen 737 can’t land where there’s no liquid hydrogen. A battery-electric regional can’t operate without charging. These factors affect the design requirements of next-gen aircraft in a way that they do not for Kerosene-powered aircraft. The bottleneck on aviation decarbonisation isn’t only fleet renewal; it’s the discrete, capital-constrained, sequenced problem of upgrading the network the fleet flies on.

My conjecture was that the sequence in which airports were upgraded would change the impact that new aircraft could make on emissions, and accordingly, the most efficacious upgrade sequence would determine the best design requirements for future aircraft.

So I built Paradigm. The reframe was small in words and large in consequence.

Optimise aircraft alone, you get plans that depend on infrastructure that doesn’t exist. Optimise infrastructure alone, you get upgrade lists with no connection to which aircraft will benefit. Coupled, the problem becomes: given a budget, a timeline, a fleet roadmap — which airports do you upgrade, in which order, to maximise CO2 reduction?

Which is to say: it’s a problem well-suited for MILP. Mixed-integer linear programming. Decades-old, well-understood by the OR^* community, solvable to optimality with off-the-shelf solvers if you formulate it sensibly - these solvers were originally designed to solve things like The Travelling Salesman Problem^*. Binary variables for whether each airport is upgraded; binary variables for whether each route is eligible (a route is eligible only if both endpoints are upgraded); a budget cap; an objective that maximises CO2 decarbonised across all eligible routes, using actual annual per-route CO2 from real flight data, not a great-circle proxy. Sequencing makes it interesting — you upgrade in stages, each with its own budget, and once an airport is upgraded it stays upgraded — so the stages chain into one combined MILP rather than a greedy step-by-step. Greedy is the obvious wrong answer; it’ll happily spend stage one on the highest-CO2 airport in isolation and miss that stage two wanted that airport’s neighbour first.

Later on, under advisement from a specialist smarter than me^* moved to CP-SAT, which enabled a LOT more that I won’t bore you with.

$$ \begin{aligned} \text{maximise} \quad & \sum_{ij}\; R_{ij}\,\text{CO}_2^{(ij)} \\ \text{subject to} \quad & R_{ij} \le A_i, \quad R_{ij} \le A_j \quad \forall\,(i,j) \\ & \sum_{i} A_i \le K \\ & A_i,\; R_{ij} \in \{0,1\} \end{aligned} $$

The core MILP. Each A_i is a binary upgrade decision per airport; each R_ij is a binary route eligibility, gated by both endpoints being upgraded; K is the airport budget; the objective is route-weighted CO₂.

Sequencing the upgrade — MILP vs. three baselines

Pick a strategy, a final 2050 budget, and a year. The arcs are the routes that get the new aircraft because both endpoints have been upgraded.

Loading optimisation data…

View

Strategy

2050 budget

K = —

Year

Year: —

Show ineligible routes faintly Animate aircraft

Blue routes: eligible now White routes: changed by the last year move

CO₂ captured by strategy and budget

Year —

Loading…

The really fun thing^* is that it proved the conjecture I had. Aircraft payload and range determine which routes a hypothetical aircraft can fly, which determines which airports matter to upgrade, which determines which aircraft designs make sense to develop. Run the loop both directions, jointly, and you get answers neither half gives alone. How this fed Boeing’s aircraft strategy is left as an exercise for the reader^*.

The cost predictors weren’t a constant per airport either^*. The infrastructure cost was a complex set of models based on distance to existing infrastructure, water availability, renewables proximity — engineering-derived predictors from public data, turned into per-airport upgrade-cost estimates. Honestly, this was a whole load of fun - solving problems using geospatial algorithms, routing (for land and air vehicles), and large-scale search trees.

Paradigm formed the backbone of my Associate Technical Fellow (ATF)^* application for Boeing. I had accomplishments in traditional engineering fields at Aurora and from previous employers, but Paradigm really felt like something powerful - if you got it then you understood that it could change how we picked which aircraft to build. The tagline could be summarised as “stop drawing pictures of cool-looking future aircraft…start figuring out what sort of aircraft we should draw pictures of.”

ATF Special Response Team members searching wooded terrain in tactical gear — Not that ATF. Though, admittedly, also a strong look. Public domain photo via DVIDS / ATF.

Paradigm needed an organisation that could reward and acknowledge cross-domain work, and Boeing’s structure whilst great at building aircraft, was not set up to celebrate that at the time. Paradigm won the No-Bull Prize at the international Crew and Fleet Optimisation Workshop^* hosted by Jeppesen in 2025, credited as the work presented that was “most economically valuable to Boeing.” A mock reviewer in the rehearsal panel for my Associate Technical Fellow interview called it “revolutionary,” and told me I was a shoo-in for ATF.

Editorial cartoon of Harry shaking hands with a judge while receiving a blue No-Bull Prize package at a Jeppesen Crew and Fleet Optimisation Workshop — The No-Bull Prize: funny name, real signal. The work made economic sense to the people who understood the network.

The ATF panel didn’t see demonstrable impact to Boeing and gave me a “sorry, no”, and around the same time the project got dropped^*. I was furious and heartbroken. It took me a while to realise the system, not the work, was the problem - and it was hard not to take this as a personal failure.

The bit I want you to take from Paradigm isn’t the equation. It’s the same complaint as the PowerPoint deck and the doctor’s-office form, scaled up. Two coupled systems — aircraft and airports — were being analysed separately, and someone was meant to manually reconcile the results. The MILP just put the coupling in the model so a human didn’t have to - and that, compared to a solver built for that purpose, a human is laughably shit at.

That sentence is the whole project.

Twenty years of trying not to be the human join operation

Trying, mildly annoyed, to put graph-shaped data somewhere a bit less wrong.

The plot stuff and Paradigm are the visible bits. Most of the work was invisible.

Twenty years of disciplined infrastructure work, no moments of insight, just one migration after another to find a less-wrong place to put graph-shaped data. Pickles, then HDF5, then SQL, then Postgres with proper foreign keys and recursive CTEs for the genuinely graph-shaped queries. None of it was a bright idea. It was being mildly annoyed at the current storage layer for not being shaped like the data was, and migrating to the next thing that was a bit less wrong. ^*

The ordinary CFD handoff: load the result, export a file, email a CSV, paste a chart. The person in the middle is the integration layer.

The same shape done properly: tools GET what they need, POST what changed, and no one has to be the join between formats.

The reason I’m spelling it out at all is that I don’t want anyone reading the rest of this post to think I’m claiming exceptionalism - quite the opposite. My understanding of infrastructure caught up to my instinct slowly, and I had many wrong steps along the way. The instinct was visible all along; the supporting tooling was the bit that took the years.

If I had to put the through-line on a t-shirt: I have spent over a decade trying not to become the join. The join between two formats. The join between a database and a paper form. The join between a fleet model and an airport model. The join between an analysis tool and a slide deck. Every time the system can’t connect two things itself, a person has to. That person is an engineer who usually has to do this in lieu of their actual job, and it’s the mundane bullshit that you’re not taught in an actual engineering degree^*…because it’s not engineering, it’s administration.

Twenty years of being the join

All of this stops if your data carries its own connections.

Five small loops. Each one is the same shape: a human re—entering structure the system already knew.

01 · the wind tunnel

Re-typing yesterday’s run sheet.

AOA, BETA, RPM, MASS_FLOW, RUN_NUM, DATE, COMMENT — every field the same value as the run an hour ago. The system that recorded those numbers is sitting one rack over. It already knows.

02 · the doctor’s office

Same name, same date of birth, same phone number, again.

The receptionist watches you fill it in. The form goes into a folder she will never open. Your details have not changed since the last visit. Nor have hers.

03 · the email reply

“Can you regen the chart with the new mesh?”

Subject line: Re: Re: Re:. The plot is a PNG. The mesh is in someone else’s git. The connection between them is in your inbox, where it has lived for three weeks.

04 · the Teams ping

Slide 14 still has the November plot.

Cert deck went out this morning. The plot it’s built on was superseded in February. Nobody told the slide. Of course they didn’t — the slide isn’t connected to anything.

05 · the post-it

“Where did this CL come from? Ask Marsh??”

The most honest provenance system in the building. A square of yellow paper, one human’s memory, and a question mark. This is the join, finally rendered visible.

01 / 05 the wind tunnel

Twenty years of being the join — and all of it stops the moment your engineering data carries the connections itself.

The frustrating bit is that the tooling for not-being-the-join existed for most of those twenty years. Foreign keys are not new. Graph databases are not new. Provenance tracking is not new. What was missing wasn’t the technology; it was anyone willing to sit in the gap between the engineers who had the data and the database people who knew how to store it properly. The OR community had MILP. The CFD community had file formats. Neither group routinely picked up the other one’s tools, so the engineer in the middle ended up doing the picking up — which is, again, the same complaint. Two systems that should be coupled aren’t, and a human ends up holding the coupling. Every time I migrated a storage layer, it was because nobody else had decided that the engineering data deserved the database treatment that the rest of the world had been using for decades. It wasn’t a frontier. It was just a gap nobody had stepped into.

Flexcompute

Where the instinct stopped being a private objection and became the brief.

Vera Yang is the president of Flexcompute. She somehow found me through one of my interactive flight-dynamics explainers — the Plotly-not-PowerPoint instinct, in a LinkedIn post — and decided I should be at her company.

Two organisations with different filters, looking at the same instinct in me, seeing different things. One graded the work on whether it could be priced into a deliverables list; the other graded it on whether it looked like the kind of thinking they wanted more of. This isn’t a slight on Boeing - they’re a big and successful company for a reason.

So - thank you, Vera. And thank you, Dom - for keeping the project alive when the system around it wasn’t built to reward that kind of work.

The view is a projection. The connections are the truth.

Here’s the bit I didn’t see for a long time. The answer was in my subconscious the whole time — the network structure, the database theory, the data richness all come together. I just didn’t have the framing to name it until I was sitting in the middle of building the next thing.

Thread is what I work on now. The technical pitch is three lines.

Simulation and modelling data is connected.
Every result depends on inputs, which depend on prior results, which depend on prior inputs, all the way back to the original geometry and meshing decisions. The connections aren’t an analogy — they’re literal. This number came from that run, which consumed those settings, which themselves were the output of some earlier step.
Each link is specific and recoverable, in principle.

The problem is that almost no one stores it that way. Excel and pandas and CSVs and folders-of-PNGs give you the illusion of connectivity — the artefacts look related, share filenames — but the actual relationships live in someone’s head, or in a spreadsheet so complex it verges on AGI^*, and walk out the door at six o’clock.

Yes. Like that.

Watch what happens on most teams when someone asks a straightforward traceability question:

A straightforward traceability question

One plot, four artefacts, no proof.

The failure is a chain, not a mood board. Scroll through the same small disaster: the answer exists, but the connection to it has been flattened away.

Current state Question asked

Harry looks frustrated as a cert engineer asks whether a hinge-moment plot is still valid after the latest configuration change. — **The failure starts innocently:** a plot that looked good in a deck is suddenly asked to remember where it came from.

Harry finds the plot pasted into a slide as a flat PNG, with no metadata or provenance attached. — **The artefact survived the meeting.** The relationship between plot, run, mesh, configuration, and script did not.

Harry searches clustered engineering files for the script that probably made the plot. — **This is the expensive bit:** reconstructing context by reading file paths, commit dates, folder names, and people’s memories.

Harry and another engineer reconstruct provenance from printouts and discover the result was for the wrong configuration. — **Nothing here is corrupt.** It is worse than that. Every individual file is plausible, and the missing thing is the chain between them.

You get engineering archaeology. Someone digs through three project folders, finds a spreadsheet from a previous version, opens a screenshot whose filename references a deleted directory, and reconstructs — by hand, badly — what should have been a single query. The reconstruction job is the data-entry job. The structure was there once. It got lost in the flatten, and data became stale, immediately.

The clean version of the failure looks like this. An engineer has a pandas DataFrame open. The columns are run IDs, lift coefficients, drag coefficients, a comment column, maybe a timestamp. Someone in another meeting asks where one row came from — what mesh, what solver settings, what geometry version, what upstream study fed it. The DataFrame doesn’t know.

If you do manage to pull all of that information into a spreadsheet or Excel, and you’re using those to compare different simulation models, then you can end up with accidental misalignment - where an Excel lookup or a Pandas where tells you that your panel method has a mesh, your CFD results have an AIC, and your data row was produced by one person where it may have been many. This may seem like a trite not-really-a-problem problem, but if you don’t do these things properly (and no-one really does), then you end up in the mess that resonated when I spelled it out a couple of months ago in Why CFD is Stuck in the File Era.

Stored properly — rather than scattered across a directory of files with a naming convention nobody can quite remember — you get provenance and associativity for free. Where did this number come from and what else came from the same source both become single queries instead of three days of detective work. The flat table view people actually want to read still works. You can see, at a glance, which cells in a row are genuinely related, and which are sitting next to each other only because they share a row label and are thus comparable in the view you’ve chosen to see.

Not a spreadsheet join

The flat view can’t invent provenance.

Spreadsheets and pandas show data the way humans need to read it — a flat table, one row per condition. They also mislead benignly: every cell in a row implies it belongs with every other cell. Thread gives you the same shape, but each cell remembers where it came from.

01 · Excel · VLOOKUP

A large mixed-model export makes the wrong join look harmless.

A working spreadsheet from a real flight-dynamics review has rows from multiple analyses — a RANS run, an AVL fit, a panel-method surrogate — all tipped into the same flat table because that’s what gets shipped between teams. The conditions columns (α, β, RPM, mesh_id) align row-by-row, and Excel will happily VLOOKUP across the table as if every cell in a row belongs together.

02 · The cell that fits, but doesn’t

5.317 isn’t wrong. It just lives on the wrong row.

The AVL CLα for α=3°, β=0° is a perfectly real number. It came from the AVL run on row a004. But it ends up displayed next to r004’s RANS data because the row above had it. The flat-row layout makes the join look harmless. The receipt — which artefact, which tool, which actor — lives somewhere outside the table.

03 · pandas · the same trap

More respectable in code, same problem.

A DataFrame is just a vectorised, type-aware Excel: same row implies same result. df.loc[(df.alpha==3)&(df.beta==0)] returns multiple rows, but the column you wanted is on row case_031 while the row you got back was case_004. No exception, no warning. The mismatch doesn’t live in the data structure, so the data structure can’t catch it.

04 · Thread · same shape, different contract

Thread renders the table you actually want to read.

α and β as columns, derived quantities as columns, one row per condition — the shape humans need. But each cell is honest: Thread knows what flowed into it and what didn’t share a source. Empty means there’s nothing honest to put there — Thread doesn’t invent a value to make the table feel complete.

05 · try it

Hover any cell. Different cells, different families.

Each cell colours the rest of the table by relationship: cyan means it flowed into the hovered one, amber means the hovered one flowed into it, grey-italic means same row but no shared source. Thread already knows which is which.

flight_dynamics_review_v3_FINAL_v7.xlsx Sheet1 · 10,247 rows

Slice of a mixed-model export

case	model	α	β	RPM	mesh_id	C_L	AVL C_Lα
r002	RANS	0°	0°	2400	vm-0e23	0.031	—
r003	RANS	2°	0°	2400	vm-0e23	0.221	—
r004	RANS	3°	0°	2400	vm-0e23	1.281	5.317
a004	AVL	3°	0°	—	—	—	5.317
r005	RANS	4°	0°	2400	vm-0e23	0.413	—
p012	Panel	4°	0°	—	—	0.398	5.412
…	mixed	…	…	…	…	…	…

VLOOKUP =VLOOKUP(3, A:H, 8, FALSE) → 5.317. Excel returns the first matching α, regardless of which model produced what. The 5.317 next to r004’s RANS data isn’t for that run.

analysis.ipynb · cell [11] In [11]

import pandas as pd

df = pd.read_csv("flight_data_mixed.csv")
df.loc[(df.alpha == 3) & (df.beta == 0),
       ["model", "mesh_id", "C_L", "AVL_CLalpha"]]
# Out[11]:

row	model	mesh_id	C_L	AVL_CLalpha
case_004	RANS	vm-0e23	1.281	5.317
case_031	AVL	NaN	NaN	5.317
case_087	Panel	NaN	0.398	5.412

No exception The DataFrame holds two rows for the same (α, β) pair — one from a RANS run, one from AVL. .loc returns both, but the column you actually wanted is on the other row. The mismatch isn’t in the data structure, so the data structure can’t catch it.

Hover or tap any cell — Thread already knows what flowed into it and what didn’t share a source. Long-press a cell on mobile to ask what produced it. Here it is, simulated; you’ll be able to try it yourself in a moment. ancestor · dependent · different thread

thread://airfoil-x57/branch/main · flat view 5 conditions · live

run	α	β	RPM	AVL C_Lα	RANS C_L	RANS C_Lα	mesh_id

Same dataset throughout. Excel would happily VLOOKUP a value across this table whether it belongs to the row or not. pandas would do the same in code, with a more respectable face. Thread can render the exact same flat view — the shape humans want to read — while keeping each cell honest about what produced it and what came from the same source. The view is a projection; the connections are the truth.

The view is a projection. The connections are the truth. The flat table is a perfectly good thing to look at, and Thread will render it for you — without quietly throwing the structure away to do so. You can ask it questions a flat table can’t answer.

The same trick lights up on a plot. A scatter chart is a projection of two columns; the rest — which mesh, which solver, which run, which actor, when — has nowhere to go on the page. In Thread it stays attached to the dot. Hover the dot, ask the question, get the answer that wasn’t on the chart. And because each plot ships with a QR code and the Thread URL embedded in the PNG metadata, the chart can wander into a deck or a memo or a Slack thread and the connection back to the data still works.

Hover any point. Right-click? Even better.

The plot only knows two numbers per point. Thread knows the rest.

A scatter plot is a projection: x, y, maybe a colour. Everything else — which mesh, which solver, which run, which actor, when — lives off the page. Thread keeps it attached to the dot.

scan

This plot is ephemeral. The connection isn’t.

Drop the PNG anywhere — deck, doc, Slack, screenshot — you can still walk back to the data.

The chart is a projection. Each dot carries a thread back to its run, its mesh, its tool, its author. Lose the chart, you keep the connection. Lose the file path, the QR code or the embedded metadata get you home. Plots come and go. Provenance stays attached.

That frees the user from the rectilinear constraints of pandas, polars, Excel, CSV. The original sin of computational workflows: the format you ship results in is the format you end up thinking about results in, and CSV in particular has been quietly limiting what people are willing to ask of their own data for thirty years.

There’s a second thing this buys you, harder to demonstrate in a screenshot but the bit users notice after a week of using it. What changed becomes a real question with a real answer. Two runs differ — and instead of squinting at numbers and filenames to guess why, you get why directly. That kind of query was technically possible before, in the same way that recovering the structure manually from a stack of CSVs is technically possible — but the cost per question was too high. Drop the cost per question and people start asking better questions, and that’s most of the value.

The bit that lands harder than the question, though, is the inverse: when something upstream changes, every plot that depends on it gets flagged stale, automatically, in every place it’s been used. The structures group reinforces a wing spar; the inertia tensor moves; some short-period-frequency plot from three weeks ago, sitting in two slide decks and a memo, is now answering the wrong question. With Thread holding the connections, that’s not “did anyone notice” — that’s a notification, before the next review, on the plot itself.

S&C does the analysis

Short-period pitching frequency vs. airspeed.

Flight Sciences runs the stability and control analysis. Two tail sizes, one clean plot, a sensible conclusion.

The plot enters the deck

The artefact becomes evidence.

The plot is pasted into a real design-review style PowerPoint slide. This is normal. It is also where the story starts to split.

The timelines split

Same deck. Two futures.

On the left, the chart is just a snapshot. On the right, the chart still has a living Thread pointer back to the run, script, and inputs.

Structures changes the spar

The CG moves. Everything downstream is now suspect.

A spar reinforcement changes inertia_tensor.json. The physics has changed; any plot downstream of the old mass properties needs to know that.

Without Thread

The deck is an old photograph.

The pasted chart keeps smiling out of the slide like nothing happened. Useful once, handsome even, but no longer alive.

With Thread

The plot warns you, then lets you update it.

The embed flags itself as stale, explains why, and offers the obvious next action: re-render against the changed source.

S&C does the analysis

Short-period pitching frequency vs. airspeed.

Stability & Control analysis

tail A tail B

fly_qual.py cfd_results/x57.h5 inertia_tensor.json

Conclusion

Both tail sizes clear the short-period target with margin at the current CG. Ship the plot to the design review.

First, nothing is wrong. S&C has a perfectly valid result.

Short-Period Pitching Frequency: Tail Size Trade

X-57 wing redesignQ4 design reviewslide 14

Frequency response vs. airspeed

tail Atail B

Assumptions

Geometry	rev C
Mass props	2026-04-12
Script	fly_qual.py
Status	approved

Conclusion: both tail sizes clear the short-period target with margin at the current CG.

The plot enters the deck. It is now persuasive, portable, and dangerously easy to trust later.

the story splits

Without Thread

Short-Period Pitching Frequency: Tail Size Trade

Q4 design reviewslide 14

Frequency response vs. airspeed

Image pasted into deck. No living dependency.

snapshot PNGno source link

With Thread

Short-Period Pitching Frequency: Tail Size Trade

Q4 design reviewslide 14

Frequency response vs. airspeed

Embed keeps a Thread URL and provenance chain.

Thread pointer livesame slide view

Same visual artefact. One is a picture; one is a view onto a connected result.

Structures commit: reinforce wing spar

The spar reinforcement moves mass aft. CG shifts by 44 mm; inertia_tensor.json changes.

commit 8a3f1c2 inertia_tensor.json CG +44 mm aft

→

The engineering changed. Anything downstream of the old inertia tensor is invalid until re-run.

Short-Period Pitching Frequency: Tail Size Trade

Q4 design reviewslide 14

Frequency response vs. airspeed

Conclusion still says it clears the target. The deck has not noticed the world moved.

Without Thread: the artefact sits in the deck. The stale conclusion remains dressed like evidence.

Short-Period Pitching Frequency: Tail Size Trade

Q4 design reviewslide 14

Frequency response vs. airspeed

Thread flags this embed because inertia_tensor.json changed upstream.

slide 14 → sp-freq-tails → fly_qual.py → inertia_tensor.json

Stale: upstream source changed

↺

This plot is out of date.Structures changed the spar and moved the CG. Re-render with the latest inertia tensor.

With Thread: the plot itself warns that it is stale and exposes the update path where the reader needs it.

Thread is the topology instinct given proper tooling, and structurally the same complaint as the PowerPoint deck refusal: data has structure, and the format you’re forcing it through doesn’t. The difference is twenty years of catching up. Refuse the flattening. Store the topology. Let the user query the actual shape of their work. Stop making humans re-enter structure the system should have kept.

The connections were always meant to be first-class. Engineering data just got there last.

The thread

One thing, twenty years, one sentence.

A short note before the close. The reason I could finally see all of this as one thing rather than as twenty separate annoyances is that I got an ADHD diagnosis and started medication a few months ago. I’ll write about that properly in the next post — there’s more to say than fits here. The relevant bit for this article is small: the diagnosis made the through-line legible, and the meds made it easier to finish the project rather than abandon it at the same point I always did before. The instinct didn’t change. The visibility did.

So.

I used to think these were separate stories. Getting annoyed at filling in my name and phone number on every visit to the GP, the PowerPoint decks, Paradigm, the database migrations, Thread. They sat in different folders in my head — one for engineering, one for software, one for personal grievance — and I treated them like unrelated things^* I happened to have opinions about.

They are one thing, really.

The thing is not “I like topology,” though eventually that became the technical language. The thing is simpler and more embarrassing: I don’t like data entry.

Not because I’m above it. Because data entry is usually a symptom. It means the system had structure, lost it somewhere, and handed the reconstruction job to a person. The PowerPoint deck flattens a graph and asks the audience to imagine the connections back. The paper form ignores a database and asks the new patient to retype it. The Cascade-shaped fleet model and the airport-shaped infrastructure model sit in different rooms and ask a strategist to hold the coupling in their head. The CSV strips the provenance and asks an engineer to reconstruct it from filenames three months later. Different systems, same move. Drop the structure. Make a human put it back, if it’s done all.

Thread is the latest and most literal version of that refusal. Store the shape. Preserve the provenance. Let the flat view be a view, not the truth. Stop making humans re-enter structure the system should have kept.

I don’t think this is a particularly profound observation. It is, if anything, the opposite — most of the people I admire technically have an instinctive version of the same complaint, and the tooling response (databases, type systems, dependency graphs, build tools, even the version-control history of a file) is largely the same response across very different fields. The bit that took me two decades wasn’t having the instinct. It was finding the right place to put it down. Plot dashboards were a place to put it down for a project. Paradigm was a place to put it down for a year. Postgres was a place to put it down with decent structure. Thread is, I think, the right place to put it down for the work I want to spend the next chunk of my career on.

That is the thread. It was there the whole time, and I’d love to talk to you about it.