Tag: llm

48 posts

We Are Starting to Sound Like the Thing We Built

“Stay on the road. Keep clear of the moors.“

·
Jul 5, 2026

HODLing gear in 2026


imouthes icon
imouthes
imouthes.offprint.app
·
Jul 4, 2026

A Month Without Frontier Models


Randoneering Blog icon
Randoneering Blog
randoneering.offprint.app
·
Jul 1, 2026
スマホのLLM

スマホのLLM

ZTE nubia LLMの場合


ドール
mochott.site/dot3pso.bsky.social
·
Jul 1, 2026

On AI Religion

·
Jun 29, 2026
300baud modem or first broadband?

300baud modem or first broadband?

At what stage of AI model evolution are we at?

·
Jun 28, 2026

Ineffizient

Mit Effizienz misst man die Ressourcennutzung – also das Verhältnis von Ergebnis zu eingesetzten Mitteln (Zeit, Geld, Material, Personal). Der Ressourcenverbrauch von LLMs ist enorm und mir erscheint es schon länger so, als ob man so etwas wie ein Auto erfunden hätte, aber dummerweise mit einem Benzinverbrauch von 10 Litern pro Kilometer. Solange ich das...

·
Jun 27, 2026
LLMs Just "Fake it `til they make it"

LLMs Just "Fake it `til they make it"

This is what happened when I tried to make a Language Model mimic my knowledge and behaviour for a chat-bot on my portfolio site.


Anthony Cregan Portfolio icon
Anthony Cregan Portfolio
anthonycregan.co.uk
·
Jun 25, 2026

What Is a Harness?

LLMs are engines. Harnesses are everything else—the wheels, brakes, dashboard, GPS—that turn a raw engine into a useful vehicle. First in a series on harnesses for the open knowledge commons.


lu.is icon
lu.is
lu.is
·
Jun 22, 2026

I built a snark detector and pointed it at myself


J
Justin-Stanley.com
justin-stanley.com
·
Jun 21, 2026

Toward Automated Discourse Network Analysis

Discourse Network Analysis has long been limited by the price of expert judgment. Here is a design for automating it at corpus scale without surrendering command of meaning — and FineStructure, the open-source workbench I am building for it.


A
Activation Layer
activationlayer.org
·
Jun 14, 2026

New Skills

Coding Skills in the Age of LLMs


V
vibecode.rodeo
vibecode.rodeo
·
Jun 14, 2026
Language integrated LLMs as an OCaml function

Language integrated LLMs as an OCaml function

Using a local DeepSeek model as an ordinary OCaml library and building sandboxed agents from simple primitives

·
Jun 13, 2026

OpenSearch Semantic Search

My learnings on OpenSearch semantic searching


Chris Parsons icon
Chris Parsons
chrisparsons.dev
·
Jun 10, 2026

OpenSearch Semantic Search

My learnings on OpenSearch semantic searching


Chris Parsons icon
Chris Parsons
chrisparsons.dev
·
Jun 10, 2026
Treating LLMs as programming books

Treating LLMs as programming books

Thoughts on an approach for using LLMs effectively for coding without losing engagement and cognitive effort.


jola.dev icon
jola.dev
jola.dev
·
Jun 8, 2026
Treating LLMs as programming books

Treating LLMs as programming books

Thoughts on an approach for using LLMs effectively for coding without losing engagement and cognitive effort.


jola.dev icon
jola.dev
jola.dev
·
Jun 8, 2026
Det här med AI eller LLM

Det här med AI eller LLM

Det här med AI eller stora språkmodeller (LLM) som det egentligen handlar om är en fråga med en mängd aspekter och synsätt. På det sociala medium, Mastodon, som jag främst använder är de flesta väldigt negativa till Artificiell intelligens (AI).


S
Svenssons Nyheter
blog.zaramis.se/
·
Jun 7, 2026

yallmap — Yet Another LLM Proxy

Building an Anthropic-native LLM gateway in TypeScript


G
Grokkist
grokkist.com
·
Jun 4, 2026
The social contract of writing

The social contract of writing

About the value of genuine writing in a world being drowned in slop.


jola.dev icon
jola.dev
jola.dev
·
May 24, 2026
The social contract of writing

The social contract of writing

About the value of genuine writing in a world being drowned in slop.


jola.dev icon
jola.dev
jola.dev
·
May 24, 2026
Wie kann ChatGPT zur Verbesserung eines Magento Onlineshops verwendet werden?

Wie kann ChatGPT zur Verbesserung eines Magento Onlineshops verwendet werden?

Dieser Beitrag soll ein paar Anregungen geben, wie man als Shopbetreiber seinen Shop mit ChatGTP verbessern kann. Wozu kann ChatGPT oder ein andere KI Client genutzt werden für Shopverbesserungen? KI Chatclient kann meiner Meinung nach sehr gut als Sparing Partner und Ideengeber liefern. Die KI kann riesige Mengen an Informationen für einen Verarbeiten ohne das...

·
May 21, 2026
e554 — SPI vs I

e554 — SPI vs I

e554 - SPI vs I: Stories and discussion on #LLM #PhoneNumber lookups, #proctors returning to #Princeton, lavish #LEGO, #LOTR and a whole lot more!


Games At Work dot Biz icon
Games At Work dot Biz
gamesatwork.biz/
·
May 18, 2026

Tacit: An Experimental LLM-First Programming Language

·
May 17, 2026

Three circles for thinking about LLMs

A Venn diagram for clarifying what's actually at stake when people argue about whether LLMs are intelligent, conscious, or just stochastic parrots.


B
benswift.me
benswift.me
·
May 11, 2026
Running local models on an M4 with 24GB memory

Running local models on an M4 with 24GB memory

Experiments with getting usable outputs out of local models on a standard Macbook


jola.dev icon
jola.dev
jola.dev
·
May 9, 2026
Running local models on an M4 with 24GB memory

Running local models on an M4 with 24GB memory

Experiments with getting usable outputs out of local models on a standard Macbook


jola.dev icon
jola.dev
jola.dev
·
May 9, 2026

Termitennae III: FEEDFOREWORD

Chapter 3

·
May 9, 2026

Collaboration: a Confused Story in Five Graphs

Some graphs about reading and writing on the internet. Less a story than a Rorschach test.


lu.is icon
lu.is
lu.is
·
Apr 30, 2026

Doing what we can where we can

·
Apr 24, 2026
Le Chat – Eine europäische Alternative zu Copilot, ChatGPT und Claude? (Werbung – unbeauftragt, unbezahlt)

Le Chat – Eine europäische Alternative zu Copilot, ChatGPT und Claude? (Werbung – unbeauftragt, unbezahlt)

Werbung – unbeauftragt & unbezahlt:Aufgrund von Markennennung, Produktdarstellung und Verlinkungen handelt es sich um Werbung, auch wenn ich das Produkt selbst gekauft habe und keine Kooperation besteht. Le Chat (frz. die Katze) ist als Katzenhalter und -liebhaber doch ein sehr ansprechender Name für eine künstliche Intelligenz. Bekannt war mir die KI auch bereits schon länger,...


Bunte Küchenabenteuer icon
Bunte Küchenabenteuer
bunte-kuechenabenteuer.de/
·
Apr 23, 2026

Wikipedia's traffic drop: more on languages and freshness

Following up on last week's post, I looked at 5,000 "Vital Articles" across eight major-language Wikipedias. Articles about math, physical sciences and tech are waaaay down, while people, geography, and history hold up far better—regardless of which language they're in. Article freshness matters too—but not as much.


lu.is icon
lu.is
lu.is
·
Apr 23, 2026
Magento Onlineshop Wiederkäufe durch KI/LLMs steigern

Magento Onlineshop Wiederkäufe durch KI/LLMs steigern

In diesem Beitrag soll einmal auf theoretischem Level erläutert werden welche Daten in einem Magento Onlineshop vorhanden sind und wie diese über eine KI-Anbindung "gehoben" werden können. Der Fokus liegt in diesem Beitrag auf den Wiederkäufe. Somit Stammkundschaft stärken. Welche Daten gibt es im Onlineshop? Bestelldaten Der Shop erfasst logischerweise die Bestellungen. Diese Bestellungen sind...

·
Apr 21, 2026

Career articles on Wikipedia: some scary numbers

I took a look at English Wikipedia pageviews for ~4,000 articles about careers. The numbers are grim: the median is down 28% from pre-COVID, with a huge drop in the last year.


lu.is icon
lu.is
lu.is
·
Apr 19, 2026

Same Agent, Different Score: The Problem With Testing Non-Deterministic AI

Before building tools for my Zork-playing agents, I needed a benchmark I could trust. I ran five local models through fifty playthroughs and discovered that the same model can score 40 or 0 on the same game. Getting honest numbers required three harness versions, structured telemetry, and a loop detector that learned the difference between stuck and thorough.

·
Apr 16, 2026
How to hit your Claude weekly limit so you can go outside and touch grass

How to hit your Claude weekly limit so you can go outside and touch grass

A satirical guide to maxing out your Claude weekly limit so you finally go outside and touch grass, featuring sub-agents, MCPs, and max effort.


jola.dev icon
jola.dev
jola.dev
·
Apr 12, 2026
How to hit your Claude weekly limit so you can go outside and touch grass

How to hit your Claude weekly limit so you can go outside and touch grass

A satirical guide to maxing out your Claude weekly limit so you finally go outside and touch grass, featuring sub-agents, MCPs, and max effort.


jola.dev icon
jola.dev
jola.dev
·
Apr 12, 2026

The Liebox That Wears Our Words

A call to hold the line against Claudeswallop.

·
Apr 9, 2026

Stuck in the Maze: Why AI Agents Can't Hold the Map

I had local AI models play Zork, the 1981 text adventure, to study why agents struggle to navigate connected systems. One started responding in Thai. Most scored zero. All got hopelessly stuck in the maze. What broke says a lot about why agents get lost in microservices too.

·
Apr 6, 2026

The machines are not, in fact, fine.

"Claude, write me a viral debut blog post using these three sources, make no mistakes."

·
Apr 5, 2026

Process Sandboxing & Agents (in 5min)

Linux sandboxing for the win.

·
Apr 1, 2026
Specs to rule them all?

Specs to rule them all?

With LLMs being good enough to generate code, the implementation cost is dropping drastically. And a trident is emerging which can be an attempt to harness how aligned agents are to real intent. Is Spec Driven Development here to stay?

·
Mar 26, 2026

Local first Fill-in-the-Middle (FIM) with llama.cpp

·
Mar 25, 2026

The model vs the harness

"What model does it use?" is the wrong question. Most of the differences you feel between AI tools come from the harness, not the model.


J
Jacob Bennett
jacob.blog
·
Mar 15, 2026
Stay in the Loop: How I Actually Use Claude Code

Stay in the Loop: How I Actually Use Claude Code

How to multi-task Claude Code while staying in the loop, increasing success rate and parallelization.


jola.dev icon
jola.dev
jola.dev
·
Mar 6, 2026
Connecting the dots for biodiversity action from the NAS/Royal Society Forum

Connecting the dots for biodiversity action from the NAS/Royal Society Forum

Summary of the Nine Recommendations and Biodiversity Monitoring Standards Framework papers from the NAS/Royal Society US-UK Forum in summer 2025, and how they connect to my work on collective knowledge systems, TESSERA, and evidence synthesis.

·
Mar 6, 2026
Stay in the Loop: How I Actually Use Claude Code

Stay in the Loop: How I Actually Use Claude Code

How to multi-task Claude Code while staying in the loop, increasing success rate and parallelization.


jola.dev icon
jola.dev
jola.dev
·
Mar 6, 2026

Lawyers, Humility, and LLMs

Reviewing a book about a multi-billion-dollar contract bug—and what it means for the profession's arrogant response to LLMs.


lu.is icon
lu.is
lu.is
·
Mar 4, 2026