Replication


❝   And that’s why mistakes had to be corrected. BASF fully recognized that Ostwald would be annoyed by criticism of his work. But they couldn’t tiptoe around it, because they were trying to make ammonia from water and air. If Ostwald’s work couldn’t help them do that, then they couldn’t get into the fertilizer and explosives business. They couldn’t make bread from air. And they couldn’t pay Ostwald royalties. If the work wasn’t right, it was useless to everyone, including Ostwald.

From Paul von Hippel, "[When does science self-correct?](https://goodscience.substack.com/p/when-does-science-self-correct-lessons)". And when not.

Intelligible Failure

Adam Russell created the DARPA SCORE replication project. Here he reflects on the importance of Intelligible Failure.

[Advanced Research Projects Agencies] need intelligible failure to learn from the bets they take. And that means evaluating risks taken (or not) and understanding—not merely observing—failures achieved, which requires both brains and guts. That brings me back to the hardest problem in making failure intelligible: ourselves. Perhaps the neologism we really need going forward is for intelligible failure itself—to distinguish it, as a virtue, from the kind of failure that we never want to celebrate: the unintelligible failure, immeasurable, born of sloppiness, carelessness, expediency, low standards, or incompetence, with no way to know how or even if it contributed to real progress.

Do your own research

Sabine Hossenfelder’s Do your own research… but do it right is an excellent guide to critical thinking and a helpful antidote to the meme that no one should “do your own research”.

  1. (When not to do you own research.)
  2. Prepare well:
    • Reasonable expections: what can you reasonably learn in hours of online/library work?
    • Which specific questions are you trying to answer?
    • Be honest with yourself: about your biases and about what you don’t understand, or aren’t understanding as you read.
  3. Start with basics: Begin wtih peer-reviewed review articles, reports, lectures, & textbooks. Then look at recent publications. Use Google Scholar and related services to track citations to your source. Check for predatory journals. Beware preprints and conference proceedings, unless you can consult an expert.
  4. No cherry-picking! [ Even though you probably started because someone is wrong on the internet. -crt] This is the #1 mistake of “do your own research”.
  5. Track down sources
    • Never trust 2nd hand sources. Look at them to get started, but don’t end there.
    • If data is available, favor that over the text. Abstracts and conclusions especially tend to overstate.

Alan Jacobs' Two versions of covid skepticism summarizes a longer piece by Madeleine Kearns. Both are worth reading.

To his quotes I’ll add the core folly Kearns charges both the Covidians and the Skeptics with:


❝   by making problems that are in essence forever with us seem like a unique historical rupture.

...much exaggerated.

Headlines about the death of theory are philosopher clickbait. Fortunately Laura Spinney’s article is more self-aware than the headline:


❝   But Anderson’s [2008] prediction of the end of theory looks to have been premature – or maybe his thesis was itself an oversimplification. There are several reasons why theory refuses to die, despite the successes of such theory-free prediction engines as Facebook and AlphaFold. All are illuminating, because they force us to ask: what’s the best way to acquire knowledge and where does science go from here?

(Note: Laura Spinney also wrote Pale Rider, a history of the 1918 flu.)

Big Data?

Forget Facebook for a moment. Image classification is the undisputed success of black-box AI: we don’t know how to write a program to recognize cats, but we can train a neural net on lots of picture and “automate the ineffable”.

But we’ve had theory-less ways to recognize cat images for millions of years. Heck, we have recorded images of cats from thousands of years ago. Automating the ineffable, in Kozyrkov’s lovely phrase, is unspeakably cool, but it has no bearing on the death of theory. It just lets machines do theory-free what we’ve been doing theory-free already.

Mis-understanding

The problem with black boxes is supposedly that we don’t understand what they’re doing. Hence DARPA’s “Third Wave” of Explainable AI. Kozyrkov thinks testing is better than explaining - after all we trust humans and they can’t explain what they’re doing.

I’m more with DARPA than Kozyrkov here: explainable is important because it tells us how to anticipate failure. We trust inexplicable humans because we basically understand their failure modes. We’re limited, but not fragile.

But theory doesn’t mean understanding anyway. That cat got out of the bag with quantum mechanics. Ahem.

Apparently the whole of quantum theory follows from startlingly simple assumptions about information. That makes for a fascinating new Argument from Design, with the twist that the universe was designed for non-humans, because humans neither grasp the theory nor the world it describes. Most of us don’t understand quantum. Well maybe Feynman, though even he suggested he might not really understand.

Though Feynman and others seem happy to be instrumentalist about theory. Maybe derivability is enough. It is a kind of understanding, and we might grant that to quantum.

But then why not grant it to black-box AI? Just because the final thing is a pile of linear algebra rather than a few differential equations?

Theory-free?

I think it was Wheeler or Penrose – one of those types anyway – who imagined we met clearly advanced aliens who also seemed to have answered most of our open mathematical questions.

And then imagined our disappointment when we discovered that their highly practical proofs amounted to using fast computers to show they held for all numbers tried so far. However large that bound was, we should be rightly disappointed by their lack of ambition and rigor.

Theory-free is science-free. A colleague (Richard de Rozario) opined that “theory-free science” is a category error. It confuses science with prediction, when science is also the framework where we test predictions, and the error-correction system for generating theories.

Back to the article

Three examples from the article:

Machines can predict better than professionals.

Certainly. Since the 1970s when Meehl showed that simple linear regressions could outpredict psychiatrists, clinicians, and other professionals. In later work he showed they could do that even if the parameters were random.

So beating these humans isn’t prediction trumping theory. It’s just showing disciplines with really bad theory.

Prospecting Gaps

I admire Tom Griffiths, and any work he does. He’s one of the top cognitive scientists around, and using neural nets to probe the gaps in prospect theory is clever; whether it yield epicycles or breakthroughs it should advance the field.

He’s right that more data means you can support more epicycles. But basic insights Wallace’s MML remain: if the sum of your theory + data is not smaller than the data, you don’t have an explanation.

Regularization

AlphaFold’s jumping-off point was the ability of human gamers to out-fold traditional models. The gamers intuitively discovered patterns – though they couldn’t fully articulate them. So this was just another case of automating the ineffable.

But the deep nets that do this are still fragile – they fail in surprising ways that humans don’t, and they are subject to bizarre hacks, because their ineffable theory just isn’t strong enough. Not yet anyway.

So we see that while half of success of Deep Nets is Moore’s law and Thank God for Gamers, the other half is tricks to regularize the model.

That is, to reduce its flexibility.

I daresay, to push it towards theory.

Power

Screenshot of Robert Links three-tweet thread noting the study claiming no extra vax benefit to the previously infected had a confidence interval of 0..infinity. Meaning it provided no information. Clever study idea, but seriously underpowered.

Watching Sydney’s Delta cases repeat the early-phase exponential growth of Melbourne, ADSEI’s Linda McIver asks:

Would our collective understanding of covid have been different if we were all more data literate?

Almost certainly, and I’m all for it. But would that avoid

watching Sydney try all of the “can we avoid really seriously locking down” strategies that we know failed us, … like a cinema audience shouting at the screen,

Not necessarily. Probably not, even, but that’s OK. It would still be a huge step forward to acknowledge the data and decide based on costs, values, and uncertainties. I’m fine with Sydney hypothetically saying,


❝   You're right, it's likely exponential, but we can't justify full lockdown until we hit Melbourne's peak.

I might be more (or less) cautious. I might care more (or less) about the various tradeoffs. I might make a better (or worse) decision were I in charge. That’s Okay. Even with perfect information, values differ.

It’s even fine to be skeptical of data that doesn’t fit my preferred theory. Sometimes Einstein’s right and the data is wrong.

What’s not okay is denying or ignoring the data just because I don’t like the cost of the implied action. Or, funding decades-long FUD campaigns for the same reason.


PS: Here is Linda’s shout suggesting that (only) stage-4 lockdown suppressed Delta: Melbournes second wave with dates of restrictions

Arnold Kling: 2021 book titles show epistemological crisis.

This may fit with a historical pattern. The barbarians sack the city, and the carriers of the dying culture repair to their basements to write.

Though some of this goes back at least to Paul Meehl.

HTT: Bob Horn. Again.

In a long post on sustained irrationality in the markets, Vitalik Buterin describes his experience wading into to 2020 election prediction markets:

I decided to make an experiment on the blockchain that I helped to create: I bought $2,000 worth of NTRUMP (tokens that pay $1 if Trump loses) on Augur. Little did I know then that my position would eventually increase to $308,249, earning me a profit of over $56,803, and that I would make all of these remaining bets, against willing counterparties, after Trump had already lost the election. What would transpire over the next two months would prove to be a fascinating case study in social psychology, expertise, arbitrage, and the limits of market efficiency, with important ramifications to anyone who is deeply interested in the possibilities of economic institution design.

There’s a skippable technical section. His take-home is that intellectual underconfidence is a big part of why these markets can stay so wrong for so long.

But nevertheless it seems to me more true than ever that, as goes the famous Yeats quote, “the best lack all conviction, while the worst are full of passionate intensity.”

Airborne / VAERS

Thanks to Mike Bishop for alerting me to Jiminez' 100-tweet thread and Lancet paper on the case for COVID-19 aerosols, and the fascinating 100-year history that still shapes debate.

Because of that history, it seems admitting “airborne” or “aerosol” has been quite a sea change. Some of this is important - “droplets” are supposed to drop, while aerosols remain airborne and so circulate farther.

But some seems definitional - a large enough aerosol is hindered by masks, and a small droplet doesn’t drop.

However, point being that like measles and other respiratory viruses, “miasma” isn’t a bad concept, so contagion can travel, esp. indoors.

VAERS Caveat

Please people, if using VAERS, go check the details. @RealJoeSmalley posts stuff like “9 child deaths in nearly 4,000 vaccinations”, but it’s not his responsibility if the data is wrong, caveat emptor.

With VAERS that’s highly irresponsible - you can’t even use VAERS without reading about its limits.

I get 9 deaths in VAERS if I set the limits to “<18”. But the number of total US vaccinations for <18 isn’t 4,000 - it’s 2.2M.

Also I checked the 9 VAERS deaths for <18:

Two are concerning because little/no risk:

  1. 16yo, only risk factor oral contraceptives
  2. 15yo, no known risks

Two+ are concerning but seem experimental. AFAIK the vaccines are not approved for breastfeeding, and are only in clinical trial for young children. Don’t try this at home:

  1. 5mo breastmilk exposure - mom vaccinated. (?!)
  2. 2yo in ¿illicit? trial? Very odd report saying it was a clinical trial but the doctors would deny that, reporter is untraceable, batch info is untraceable. Odd.
  3. 1yo, seizure. (Clinical trial? Else how vaccinated?)

Two were very high risk patients. (Why was this even done?):

  1. 15yo with about 25 severe pre-existing / allergies
  2. 17yo w/~12 severe pre-existing / allergies

Two are clearly unrelated:

  1. Error - gunshot suicide found by family, but age typed as “1.08”.
  2. 17yo, firearm suicide - history of mental illness

For evaluating your risk, only the two teens would seem relevant. They might not be vaccine-related, but with otherwise no known risk, it’s a very good candidate cause.

VAERS Query

I’m not able to get “saved search” to work, so here’s the non-default Query Criteria:

  • Age: < 6 months; 6-11 months; 1-2 years; 3-5 years; 6-17 years
  • Event Category: Death
  • Serious: Yes
  • Vaccine Products: COVID19 VACCINE (COVID19)

Group By: VAERS ID

Ben Kuo saved a life using Google Earth and clear thinking.

www.washingtonpost.com/lifestyle…

Brand defense; vendoring culture; memory & notes

In biographies and brands, after some clear examples Jacobs notes that many accusations of “factual errors” are really brand defenses:

When they said that Jacobs makes many factual errors, they weren’t even really making a truth claim, they were uttering a spell to ward off the stranger. They were placing me outside their Inner Ring.

~ ~ ~

In vendoring culture, I’m fascinated by the parallels he draws between vendoring code, and what Gene Luen Yang has done by incorporating DC Comics' earliest racist caricature into a new comic:

What Yang has done is moral repair through vendoring code &em; in this case ... cultural code. And note that Yang has not ... simply pointed to code created and maintained by someone else. ...he could only correct it by making it his own.

Had I read both, I still wouldn’t have made that connection. Spark.

~ ~ ~

MacWright:

Frictionless note-taking produces notes, but it doesn’t - for me - produce memory.

COVID19 Origins Debate | Metaforecast

HTT Nuño Sempere’s January forecasting newsletter. And be sure to check out his marvelous Metaforecast service!

~ ~ ~

So: Metacululs and RootClaim give very different probabilities that COVID-19 originated in a lab (see earlier post summarizing Monk):

  • Metaculus: 15% for Hubei lab origins (either accidental or deliberate) - median never went above 30% during the last year.
  • RootClaim: 76% accidental release, and 2% deliberate, based on a (simplified?) Bayesian analysis.

Metaculus has ~3K forecasts on that question over the last year+, and over 260 comments, most well-informed. They’ve done well in COVID-19 forecasts vs. experts. (And famously one of their top forecasters nailed the pandemic in late January 2020, as Sempere reminds us.)

Rootclaim, as far as I can tell, begins with some crowdsourcing to formulate hypotheses, get initial probabilities, gather sources, and maybe to help set likelihoods. Then they do a Bayesian update. At one point they used full Bayesian networks. It seems this one treats each evidence-group as independent.

Both are heavily rationalist and Bayesian-friendly, and had access to each other’s forecasts. So the divergence is quite interesting - I wish I had time to dig into it some more.

AJ on Conservatism Inc., +more

Worth reading in entirety, Alan Jacobs reflects on an essay by Douthat:

Such a system, predictably, was terrible at generating the kind of outward-facing, evangelistic conservatives who had made the Reagan revolution possible.

He ends by saying Antonin Scalia was the sole survivor of the Old Republic, making Amy Coney Barret the Last Jedi.

(OK, Douthat doesn’t use Star Wars. But Sunstein did. I just blended them.)


Where Douthat muses on Trump’s loss, Ian Leslie reflected on Biden’s victory over both Sanders and Trump:

It's worth spending a bit of time on what it means to be moderate.
Three forgotten principles of moderate politics that sparkle because they are both obvious and ignored.

Turns out Leslie has a new book, Conflicted:

Disagreement is the best way of thinking we have. It weeds out weak arguments, improves decision-making, leads to new ideas, and, counter-intuitively, brings us closer to one another. But only if we do it well - and right now, we’re doing it terribly.

Looking forward to it. Mercier & Sperber demonstrated that disagreement is how groups think, and that under the right conditions, they vastly outthink people.


For some gentle advice on How to Think, try Alan Jacobs' book by that name. We all cover the philosophy & cognitive science. Jacobs tackles the human component:

[We describe] argument as war ... because in many arguments there truly is something to be lost, and most often what's under threat is social affiliation. [My bold.]

What to do?

6. Gravitate, as best you can, in every way you can, toward people who seem to value genuine community and can handle disagreement with equanimity.

Gelman on Bad Science for Good

Gelman’s recent short post on Relevance of Bad Science for Good Science includes a handy Top10 junk list:

A Ted talkin’ sleep researcher misrepresenting the literature or just plain making things up; a controversial sociologist drawing sexist conclusions from surveys of N=3000 where N=300,000 would be needed; a disgraced primatologist who wouldn’t share his data; a celebrity researcher in eating behavior who published purportedly empirical papers corresponding to no possible empirical data; an Excel error that may have influenced national economic policy; an iffy study that claimed to find that North Korea was more democratic than North Carolina; a claim, unsupported by data, that subliminal smiley faces could massively shift attitudes on immigration; various noise-shuffling statistical methods that just won’t go away—all of these, and more, represent different extremes of junk science.

And the following sobering reminder why we study failures:

None of us do all these things, and many of us try to do none of these things—but I think that most of us do some of these things much of the time.

US Insurrectionist Movement?

January 6 moved Randy Pherson at Globalytica to ask if there is an active insurrectionist movement in the US.

Before clicking, at least quickly decide whether you would rate these as High, Medium, or Low (image from his post):

Indicators of Insurrection

Then click to see ratings from a dozen of Pherson’s colleagues - probably professional or retired analysts.

What is your theory, again?

Just re-found this @ayjay essay in an old tab.

The question I would ask churches that are re-opening without masks or distancing, but with lots of congregational singing, is: How do you think infectious disease works, exactly? How do you think COVID–19 is transmitted? What’s the theory you’re operating on?

I still know people using an incoherent mix of, well, all of these:

  • There is no real pandemic.
  • It’s a Chinese bio-weapon.
  • Masks (etc.) don’t work.
  • There’s easy and effective treatments.

My People

Screen Shot 2021 02 16 at 10 17 33 PM UntitledImage

Sam Rocha, In America Magazine

Ritchie on Sloppy Pandemic Science

Essay worth reading in its entirety: The Great Reinforcer by Stuart Ritchie.

To be sure, out of the gloom of the pandemic came some incredible advances – the stunning progress made on vaccines chief among them. But these bright spots were something of an exception. For those of us with an interest in where science can go wrong, the pandemic has been the Great Reinforcer: it has underlined, in the brightest possible ink, all the problems we knew we had with the way we practice science.

Acknowledging stunning successes in the science of COVID-19, he reviews our regrettable and predictable failures. And hitting a little too close for comfort, notes how much harm comes from a desire to help.

Monk on China Conspiracy Theories - April 2020

I was reviewing this old interview with Australia’s Paul Monk, covering Coronavirus, China, conspiracy theories, reckoning, and risk to Pax Americana, as they looked to him in late April.

If nothing else, listen to him open by quoting Thucydides.

tl;dr Monk reviews key plagues from Athens to now, discusses the situation in April, and assesses three different China-did-it theories, and closes by arguing that the West must reckon with China, and it’s behavior during COV-2 has been a wake-up call.

I’ve had the pleasure to work briefly with Paul on argument mapping and critical thinking, and this is a good example of weighing plausibility and evidence.

It’s probably better to listen or read the interview, but here’s my summary of key points.

Plagues

I’ve summarized Paul’s summary in a table, filling in numbers from Wikipedia or Ancient.edu (example). It seems the Roman plagues are hard to estimate because they lasted so long.

Plague When Where # Dead % Pop
Athenian 4th C BC Athens ~100K ~1/4 - 1/3
Antonine 2nd C Roman ~5M ~1/3
Justinian 5th C Roman & Persian ~25 - 50M ~1/4 - 1/3
Black 14th C Eurasia+ ~75 - 200M ~1/4 - 1/3
Smallpox 16th C Americas ~60M ~95%
Great 17th C London ~100K ~1/4
Spanish Flu 20th C Global ~50-200M ~2-3%, but young

Conspiracy Theories

Monk does a nice job separating and assessing the China theories for the origin of SARS-COV-2. As of April, but it seems a solid assessment.

First, there’s the utterly mad theories: 5G and control chips. The China theories aren’t like that. That’s key.

  1. Unrestricted War: deliberate leak. This gains plausibility from the 1997 book Unrestricted War by two Chinese colonels who suggested this possibility. So, it’s a viable theory - and important to note it was sensible for people to draw the connection. So, two avenues of assessment:

    a. Sanity Check: Why would the Party agree, given grave risks? (a) How could they guarantee they could control it in China? (b) How could they be confident the outside world wouldn’t figure out whodunnit and “there would be hell to pay”? (c) To control in China you’d have to shut down the economy. Why? When you’re trying to dominate economically? Okay, so it would be hugely risky.

    b. Evidence: None. [In the meantime we learned a bit more about how sus' they were acting, but Monk was pretty clear on that already. And their actions are pretty likely on the other China theories too. OK, what about biology? A Taiwanese dissident published a paper saying it was a deliberate release … but her claims for that fell apart. Her publishing circumstances were also dodgy. -And as I try to remind people, evidence is a ratio: to count for this theory, it has to be more likely on this theory than on its contenders. -ed]

    c. Meta Evidence: some serious right-wing analysts looked at and dismissed this claim. You’d expect them to jump on it if they could. The intelligence communities in AU, NZ, CA, UK, and US all concluded this looks unlikely. Again, being their key job, you’d expect them to jump on it if they could - at least confidentially. But there’s no evidence in leaks or actions that this is considered remotely plausible.

  2. Unintentional Escape A - It came from Wuhan or another lab, but the usual sort of leak. (Hi, I live in Reston. Ebola anyone?) In this variant, the party simply doesn’t know how it got out, takes awhile to figure it out, and when they do realize “Wow, this is serious”, they say so. If the Party had a history of acting responsibly, this might be more plausible. It doesn’t.

  3. Unintentional Escape: B - As before, but once it leaks they think, “This looks bad” and conduct a propaganda campaign to suppress & whitewash. Evidence: they’re definitely conducting a propaganda campaign, including failed attempts to bully Australia and the rest of the WHO into not investigating.

Reckoning

Yes trading with China helped them prosper. Yes, that was good for us too. Yes, it’s middle class grew, and wanted more liberty. No, the Communist Party did not become more tractable. There was a glimmer of hope, and it was quashed.

Monk says it’s time to get consensus of Western & African countries, and tell China, “This cannot work. It can’t work for us and because it can’t work for us, you’re going to realize it cannot work for you. …there’s no other choice."

Risk of Nationalism

1918 and WWI led to the Great Depression, which ushered in reactionary politics around the world: Germany, Japan, Italy, El Salvador. We’re seeing signs now, and that threatens the Pax American that made possible “the greatest expansion in average human wellbeing across the planet … over the last [30-70] years ever in history.” To the extent that other countries follow China to say “Our interests are paramount … then we’re back in the 1930s.”

Silver Linings

Paul thinks people have become more reflective, and the pandemic has highlighted the need for global solutions and clear thinking. I suspect it’s just Paul.

He has a nice analogy to The Martian though.

A touching, real conversation on faith, death, grief, culture wars, the Christmas season, comparative immigration, and briefly, Thatcher on climate. Underneath, kinship.

api.substack.com

Another great piece at Fantastic Anachronism: Are Experts Real.

I hadn’t heard about N=59. 😱

…if the N=59 crew are making such ridiculous errors in their own papers, they obviously don’t have the ability to judge other people’s…

Richard Hanania:

A remarkably high percentage of bad economic takes can be attributed to people being unable to realize the world is not zero-sum.