Posts

Newest Oldest A-Z Z-A

Filter by tag

"And what matters is if it works."

August 22, 2023

To evaluate the performance of LLMs we have to extend our observations & analysis well-beyond the raw outputs.

treating information as atomic

52% are incorrect or 52% contain inaccuracies?

August 22, 2023

What does 'incorrect' even mean when referring to an LLM output?

incorrect treating information as atomic

"We also asked participants about their ChatGPT expertise"

August 21, 2023

Excerpt from a study on programming expertise showing participants self-reported as mostly proficient programmers but only competent in ChatGPT usage, revealing a skill gap between traditional coding and AI tool proficiency.

expertise survey-questions

Heteromation astride 'automation v. augmentation'

August 21, 2023

Brief post positioning heteromation concept alongside the automation versus augmentation debate, referencing Ekbia's work.

How do people perceive and perform-with tool outputs?

August 21, 2023

To better understand and address tool use, we need to understand not 'accuracy' but interaction.

results-of-search evaluating-results-meta

Rethinking Rethinking Search: Making Domain Experts out of Dilettantes

August 21, 2023

Analysis of Metzler et al.'s 'Rethinking Search' paper and its framing of LLMs as dilettantes, reframing the concept to consider how human dilettantes might interact with or as domain experts in search practice.

a toothpick, a bowl of pudding, a full glass of water, and a marshmallow

August 10, 2023

ChatGPT-4 seems able to now adequately address this specific stacking challenge?

prompt engineering custom instructions

What if general-purpose web search were more like searching for directions?

August 7, 2023

Thinking about what if general-purpose web search were more like searching for directions...

extending search support

"Google the Gatekeeper: How Search Components Affect Clicks and Attention"

August 7, 2023

search-audits ten-blue-links

Are prompts—& queries—not Lipschitz?

August 4, 2023

A tweet from @zacharylipton: Prompts are not Lipschitz. There are no “small” changes to prompts...

prompt engineering

Data Voids and the Google This Ploy

August 4, 2023

Thinking about comparing and contrasting 'prompt injection' and the 'Google This Ploy' (Caulfield, 2019)

prompt injection search directives data voids

But how do we ground our relations to hallucination?

August 3, 2023

Thinking about hallucination with Klosterman, Leahu, Munk et al., Rettberg, and Powles & Nissenbaum.

hallucination surprise

Keyword search is dead?

August 3, 2023

keyword search hallucination full questions automation bias opening-closing opacity musingful-memo

"However, we often consult Google about topics we might not know enough about..."

August 3, 2023

Meno-Paradox

"For if a reader goes to a book searching for new ideas..."

August 3, 2023

Meno-Paradox

OWASP Top 10 for Large Language Model Applications

August 3, 2023

Here is the 'OWASP Top 10 for Large Language Model Applications'. Overreliance is relevant to my research. (I’ve generally used the term “automation bias”, though perhaps a more direct term like overr...

automation-bias decoupling spaces-for-evaluation prompt-injection inadequate-informing Meno-Paradox

hints to jump to search

August 2, 2023

Logan Kilpatrick just announced search in OpenAI's developer docs so I glanced at it and saw the hint to jump to the search bar (their DocSearch-Button-Keys): ⌘ & K. I appreciate such micro interactio...

micro interactions in search

My academic search kit

August 2, 2023

Collection of tweets describing academic search workflow using Twitter, Semantic Scholar, local whoosh index, and various citation tools while exploring alternatives to Google Scholar.

academic-search

academic search tools

August 2, 2023

This is an incomplete listing of a few academic search tools. See also: - my paper w/ Jake Goldenfein (Google Scholar – Platforming the scholarly economy (goldenfein2022platforming)) - my scholar prof...

Google Scholar Elicit Consensus Internet Archive Scholar Open Alex academic-search

"A good search engine advises its users..."

July 31, 2023

Posts

&quot;And what matters is if it works.&quot;

52% are incorrect or 52% contain inaccuracies?

&quot;We also asked participants about their ChatGPT expertise&quot;

Heteromation astride 'automation v. augmentation'

How do people perceive and perform-with tool outputs?

Rethinking Rethinking Search: Making Domain Experts out of Dilettantes

a toothpick, a bowl of pudding, a full glass of water, and a marshmallow

What if general-purpose web search were more like searching for directions?

"Google the Gatekeeper: How Search Components Affect Clicks and Attention"

Are prompts—& queries—not Lipschitz?

Data Voids and the Google This Ploy

But how do we ground our relations to hallucination?

Keyword search is dead?

"However, we often consult Google about topics we might not know enough about..."

"For if a reader goes to a book searching for new ideas..."

OWASP Top 10 for Large Language Model Applications

hints to jump to search

My academic search kit

academic search tools

"A good search engine advises its users..."

"And what matters is if it works."

"We also asked participants about their ChatGPT expertise"