contributed

Community-generated articles submitted for your reading pleasure.

Related Tags

Subscribe to the podcast

Get The Stack Overflow Podcast at your favorite listening service.

Apple Podcasts Overcast Pocket Casts Spotify RSS feed

April 28, 2025

How self-supervised learning revolutionized natural language processing and gen AI

Self-supervised learning is a key advancement that revolutionized natural language processing and generative AI. Here’s how it works and two examples of how it is used to train language models.

Cameron R. Wolfe, PhD

10 comments

llm

April 14, 2025

Like self-driving cars, fully AI-automated sysadmins don't exist

As with cars, there are few system administration tasks that involve little to no automation.

Scott McCarty

4 comments

automation devops AI

April 3, 2025

From training to inference: The new role of web data in LLMs

Data has always been key to LLM success, but it's becoming key to inference-time performance as well.

Or Lenchner

2 comments

data science generative AI data

February 26, 2025

Variants of LoRA

Want to train a specialized LLM on your own data? The easiest way to do this is with low rank adaptation (LoRA), but many variants of LoRA exist.

Cameron R. Wolfe, PhD

0 comments

llm

January 22, 2025

Why all developers should adopt a safety-critical mindset

Is anyone designing software where failures don't have consequences?

Austin Spiegel

8 comments

testing observability failover

December 31, 2024

Generative AI is not going to build your engineering team for you

It’s easy to generate code, but not so easy to generate good code.

Charity Majors

20 comments

generative AI Engineering coding Business Hub

December 24, 2024

You should keep a developer’s journal

A developer’s journal is a place to define the problem you’re solving and record what you tried and what worked.

Max Pekarsky

22 comments

writing

December 20, 2024

This developer tool is 40 years old: can it be improved?

Would updating a tool few think about make a diff(erence)?

Bill Harding

26 comments

pull requests code review diff

December 5, 2024

Four approaches to creating a specialized LLM

Wondering how to go about creating an LLM that understands your custom data? Start here.

Cameron R. Wolfe, PhD

2 comments

llm

September 26, 2024

Masked self-attention: How LLMs learn relationships between tokens

Masked self-attention is the key building block that allows LLMs to learn rich relationships and patterns between the words of a sentence. Let’s build it together from scratch.

Cameron R. Wolfe, PhD

1 comment

llm

September 5, 2024

The hidden cost of speed

It’s tempting to push projects out the door to woo and impress colleagues and supervisors, but the stark truth is that even the smallest projects should have proper review periods.

Brayden A. Hord

8 comments

tech debt

September 4, 2024

Best practices for cost-efficient Kafka clusters

In today's data-driven world, Apache Kafka has emerged as a cornerstone of modern data streaming, particularly with the rise of AI and the immense volumes of data it generates.

Yaniv Ben Hemo

0 comments

kafka

August 22, 2024

LLMs evolve quickly. Their underlying architecture, not so much.

The decoder-only transformer architecture is one of the most fundamental ideas in AI research.

Cameron R. Wolfe, PhD

0 comments

llm architecture AI

August 15, 2024

Practical tips for retrieval-augmented generation (RAG)

Retrieval-augmented generation (RAG) is one of the best (and easiest) ways to specialize an LLM over your own data, but successfully applying RAG in practice involves more than just stitching together pretrained models.

Cameron R. Wolfe, PhD

0 comments

retrieval augmented generation llm generative AI