The Case That A.I. Is Thinking

239 points by ascertain a day ago

tkz1312 21 hours ago

Having seen LLMs so many times produce coherent, sensible and valid chains of reasoning to diagnose issues and bugs in software I work on, I am at this point in absolutely no doubt that they are thinking.

Consciousness or self awareness is of course a different question, and ones whose answer seems less clear right now.

Knee jerk dismissing the evidence in front of your eyes because you find it unbelievable that we can achieve true reasoning via scaled matrix multiplication is understandable, but also betrays a lack of imagination and flexibility of thought. The world is full of bizarre wonders and this is just one more to add to the list.

keiferski 10 hours ago

I don’t see how being critical of this is a knee jerk response.
Thinking, like intelligence and many other words designating complex things, isn’t a simple topic. The word and concept developed in a world where it referred to human beings, and in a lesser sense, to animals.
To simply disregard that entire conceptual history and say, “well it’s doing a thing that looks like thinking, ergo it’s thinking” is the lazy move. What’s really needed is an analysis of what thinking actually means, as a word. Unfortunately everyone is loathe to argue about definitions, even when that is fundamentally what this is all about.
Until that conceptual clarification happens, you can expect endless messy debates with no real resolution.
“For every complex problem there is an answer that is clear, simple, and wrong.” - H. L. Mencken
- jvanderbot 5 hours ago
  
  It may be that this tech produces clear, rational, chain of logic writeups, but it's not clear that just because we also do that after thinking that it is only thinking that produces writeups.
  It's possible there is much thinking that does not happen with written word. It's also possible we are only thinking the way LLMs do (by chaining together rationalizations from probable words), and we just aren't aware of it until the thought appears, whole cloth, in our "conscious" mind. We don't know. We'll probably never know, not in any real way.
  But it sure seems likely to me that we trained a system on the output to circumvent the process/physics because we don't understand that process, just as we always do with ML systems. Never before have we looked at image classifications and decided that's how the eye works, or protein folding and decided that's how biochemistry works. But here we are with LLMs - surely this is how thinking works?
  Regardless, I submit that we should always treat human thought/spirit as unknowable and divine and sacred, and that anything that mimics it is a tool, a machine, a deletable and malleable experiment. If we attempt to equivocate human minds and machines there are other problems that arise, and none of them good - either the elevation of computers as some kind of "super", or the degredation of humans as just meat matrix multipliers.
  - grayhatter 4 hours ago
    
    The contrast between your first and last paragraph is... unexpected
    > It may be that this tech produces clear, rational, chain of logic writeups, but it's not clear that just because we also do that after thinking that it is only thinking that produces writeups.
    I appreciate the way you describe this idea, I find it likely I'll start describing it the same way. But then you go on to write:
    > Regardless, I submit that we should always treat human thought/spirit as unknowable and divine and sacred, and that anything that mimics it is a tool, a machine, a deletable and malleable experiment. If we attempt to equivocate human minds and machines there are other problems that arise, and none of them good - either the elevation of computers as some kind of "super", or the degredation of humans as just meat matrix multipliers.
    Which I find to be the exact argument that you started by discarding.
    It's not clear that equating organic, and synthetic thought will have any meaningful outcome at all, let alone worthy of baseless anxiety that it must be bad. Equally it seems absolutely insane to claim that anything is unknowable, and that because humanity doesn't have a clear foundational understanding that we should pretend that it's either devine, or sacred. Having spent any time watching the outcome of the thoughts of people, neither devine nor sacred are reasonable attributes to apply, but more importantly, I'd submit that you shouldn't be afraid to explore things you don't know, and you shouldn't advocate for others to adopt your anxieties.
    
    jvanderbot 4 hours ago
    
    > It's not clear that equating organic, and synthetic thought will have any meaningful outcome at all,
    I agree! I'm saying "If we equate them, we shortcut all the good stuff, e.g., understanding", because "it may be that this tech produces what we can, but that doesn't mean we are the same", which is good because it keeps us learning vs reducing all of "thinking" to just "Whatever latest chatgpt does". We have to continue to believe there is more to thinking, if only because it pushes us to make it better and to keep "us" as the benchmark.
    Perhaps I chose the wrong words, but in essence what I'm saying is that giving up agency to a machine that was built to mimic our agency (by definition as a ML system) should be avoided at all costs.
  - bunderbunder 3 hours ago
    
    > Never before have we looked at image classifications and decided that's how the eye works
    Actually we have, several times. But the way we arrived at those conclusions is worth observing:
    1. ML people figure out how the ML mechanism works.
    2. Neuroscientists independently figure out how brains do it.
    3. Observe any analogies that may or may not exist between the two underlying mechanisms.
    I can't help but notice how that's a scientific way of doing it. By contrast, the way people arrive at similar conclusions when talking about LLMs tends to consist of observing that two things are cosmetically similar, so they must be the same. That's not just pseudoscientific; it's the mode of reasoning that leads people to believe in sympathetic magic.
- pmarreck 7 hours ago
  
  So it seems to be a semantics argument. We don't have a name for a thing that is "useful in many of the same ways 'thinking' is, except not actually consciously thinking"
  I propose calling it "thunking"
  - skeeter2020 5 hours ago
    
    I don't like it for a permanent solution, but "synthetic thought" might make a good enough placeholder until we figure this out. It feels most important to differentiate because I believe some parties have a personal interest in purposely confusing human thought with whatever LLMs are doing right now.
    
    pmarreck 2 hours ago
    
    agreed.
    also, sorry but you (fellow) nerds are terrible at naming.
    while "thunking" possibly name-collides with "thunks" from CS, the key is that it is memorable, 2 syllables, a bit whimsical and just different enough to both indicate its source meaning as well as some possible unstated difference. Plus it reminds me of "clunky" which is exactly what it is - "clunky thinking" aka "thunking".
    And frankly, the idea it's naming is far bigger than what a "thunk" is in CS
    
    N7lo4nl34akaoSN 2 hours ago
    
    .
    
    Ir0nMan 36 minutes ago
    
    >"artificial thought"
    How about Artificial Intelligence?
    
    pmarreck 11 minutes ago
    
    "intelligence" encompassing "thinking" then becomes the hangup.
    I still say it needs a new name. If we want to be generous, we could state "the limit as time approaches infinity of thunking, is thinking." (I don't believe we will ever achieve astronomically-superior AGI, and certainly don't believe it will ever have a will of its own that someone else didn't give it- which just makes it a tool.)
    
    pmarreck an hour ago
    
    that's too clunky. in fact, "clunky thinking" is what gave me the idea of "thunking"
    you guys would have called lightsabers "laser swords" like Lucas originally did before Alec Guinness corrected him
  - GoblinSlayer 6 hours ago
    
    They moved goalposts. Linux and worms think too, the question is how smart are they. And if you assume consciousness has no manifestation even in case of humans, caring about it is pointless too.
    
    fellowniusmonk 3 hours ago
    
    Yes, worms think, let the computers have thinking too, the philosophers can still argue all they want about consciousness.
    Humans are special, we emit meaning the way stars emit photons, we are rare in the universe as far as empirical observation has revealed. Even with AGI the existence of each complex meaning generator will be a cosmic rarity.
    For some people that seems to be not enough, due to their factually wrong word views they see themselves as common and worthless (when they empirically aren't) and need this little psychological boost of unexaminable metaphysical superiority.
    But there is an issue of course, the type of thinking humans do is dangerous but net positive and relatively stable, we have a long history where most instantiations of humans can persist and grow themselves and the species as a whole, we have a track record.
    These new models do not, people have brains that as they stop functioning they stop persisting the apparatus that supports the brain and they die, people tend to become less capable and active as their thinking deteriorates and hold less influence ocer others accept in rare cases.
    This is not the case for an LLM, they seem to be able to hallucinate endlessly and as they have access to the outside world maintain roughly their same amount of causal leverage, their clarity and accuracy of their thinking isn't tied to their persisting.
    
    fragmede 2 hours ago
    
    Are we that special? We may be the only species left on Earth that's built civilization, but there are other species on Earth that we've deemed sentient, even if they don't have smartphones. (That may argue that they're smarter than us though.) If octopodes can dream, if elephants get depressed when their spouse dies, then I'd we're not so totally alone on our own planet, then it seems, despite no evidence, that we can't be totally alone in the universe. That is for philosophy professors to ponder Drakes equation until we have irrefutable evidence, however.
  - conorcleary 7 hours ago
    
    Clinking? Clanker Thunking?
    
    mhb 5 hours ago
    
    Close. Clanking.
- anon291 6 minutes ago
  
  The simulation of a thing is not the thing itself because all equality lives in a hierarchy that is impossible to ignore when discussing equivalence.
  Part of the issue is that our general concept of equality is limited by a first order classical logic which is a bad basis for logic
- WhyOhWhyQ 3 hours ago
  
  What does it mean? My stance is it's (obviously and only a fool would think otherwise) never going to be conscious because consciousness is a physical process based on particular material interactions, like everything else we've ever encountered. But I have no clear stance on what thinking means besides a sequence of deductions, which seems like something it's already doing in "thinking mode".
  - pixl97 2 hours ago
    
    > is a physical process based on particular material interactions,
    This is a pretty messy argument as computers have been simulating material interactions for quite some time now.
- lukebuehler 10 hours ago
  
  If cannot the say they are "thinking", "intelligent" while we do not have a good definition--or, even more difficult, unanimous agreement on a definition--then the discussion just becomes about output.
  They are doing useful stuff, saving time, etc, which can be measured. Thus also the defintion of AGI has largely become: "can produce or surpass the economic output of a human knowledge worker".
  But I think this detracts from the more interesting discussion of what they are more essentially. So, while I agree that we should push on getting our terms defined, I think I'd rather work with a hazy definition, than derail so many AI discussion to mere economic output.
  - Rebuff5007 10 hours ago
    
    Heres a definition. How impressive is the output relative to the input. And by input, I don't just mean the prompt, but all the training data itself.
    Do you think someone who has only ever studied pre-calc would be able to work through a calculus book if they had sufficient time? how about a multi-variable calc book? How about grad level mathematics?
    IMO intelligence and thinking is strictly about this ratio; what can you extrapolate from the smallest amount of information possible, and why? From this perspective, I dont think any of our LLMs are remotely intelligent despite what our tech leaders say.
    
    kryogen1c 8 hours ago
    
    Hear, hear!
    I have long thought this, but not had as good way to put it as you did.
    If you think about geniuses like Einstein and ramanujen, they understood things before they had the mathematical language to express them. LLMs are the opposite; they fail to understand things after untold effort, training data, and training.
    So the question is, how intelligent are LLMs when you reduce their training data and training? Since they rapidly devolve into nonsense, the answer must be that they have no internal intelligence
    Ever had the experience of helping someone who's chronically doing the wrong thing, to eventually find they had an incorrect assumption, an incorrect reasoning generating deterministic wrong answers? LLMs dont do that; they just lack understanding. They'll hallucinate unrelated things because they dont know what they're talking about - you may have also had this experience with someone :)
    
    nsagent 4 minutes ago
    
    You might be interested in reading about the minimum description length (MDL) principle [1]. Despite all the dissenters to your argument, what your positing is quite similar to MDL. It's how you can fairly compare models (I did some research in this area for LLMs during my PhD).
    Simply put, to compare models, you describe both the model and training data using a code (usual reported as number of bits). The trained model that represents the data within the fewest number of bits is the more powerful model.
    This paper [2] from ICML 2021 shows a practical approach for attempting to estimate MDL for NLP models applied to text datasets.
    [1]: http://www.modelselection.org/mdl/
    [2]: https://proceedings.mlr.press/v139/perez21a.html
    
    hodgehog11 5 hours ago
    
    > So the question is, how intelligent are LLMs when you reduce their training data and training? Since they rapidly devolve into nonsense, the answer must be that they have no internal intelligence
    This would be the equivalent of removing all senses of a human from birth and expecting them to somehow learn things. They will not. Therefore humans are not intelligent?
    > LLMs dont do that; they just lack understanding.
    You have no idea what they are doing. Since they are smaller than the dataset, they must have learned an internal algorithm. This algorithm is drawing patterns from somewhere - those are its internal, incorrect assumptions. It does not operate in the same way that a human does, but it seems ridiculous to say that it lacks intelligence because of that.
    It sounds like you've reached a conclusion, that LLMs cannot be intelligent because they have said really weird things before, and are trying to justify it in reverse. Sure, it may not have grasped that particular thing. But are you suggesting that you've never met a human that is feigning understanding in a particular topic say some really weird things akin to an LLM? I'm an educator, and I have heard the strangest things that I just cannot comprehend no matter how much I dig. It really feels like shifting goalposts. We need to do better than that.
    
    pka 5 hours ago
    
    > and are trying to justify it in reverse
    In split-brain experiments this is exactly how one half of the brain retroactively justifies the action of the other half. Maybe it is the case in LLMs that an overpowered latent feature sets the overall direction of the "thought" and then inference just has to make the best of it.
    
    tremon 4 hours ago
    
    I'm not sure I understand what you're getting at. You seem to be on purpose comparing apples and oranges here: for an AI, we're supposed to include the entire training set in the definition of its input, but for a human we don't include the entirety of that human's experience and only look at the prompt?
    
    Rebuff5007 3 hours ago
    
    > but for a human we don't include the entirety of that human's experience and only look at the prompt?
    When did I say that? Of course you look at a human's experience when you judge the quality of their output. And you also judge their output based on the context they did their work in. Newton wouldn't be Newton if he was the 14th guy to claim that the universe is governed by three laws of motion. Extending the example I used above, I would be more impressed if an art student aced a tough calc test than a math student, given that a math student probably has spent much more time with the material.
    "Intelligence and "thinking" are abstract concepts, and I'm simply putting forward a way that I think about them. It works very much outside the context of AI too. The "smartest" colleagues I've worked with are somehow able to solve a problem with less information or time than I need. Its usually not because they have more "training data" than me.
    
    mycall 9 hours ago
    
    Animals think but come with instincts which breaks the output relative to the input test you propose. Behaviors are essentially pre-programmed input from millions of years of evolution, stored in the DNA/neurology. The learning thus typically associative and domain-specific, not abstract extrapolation.
    A crow bending a piece of wire into a hook to retrieve food demonstrates a novel solution extrapolated from minimal, non-instinctive, environmental input. This kind of zero-shot problem-solving aligns better with your definition of intelligence.
    
    lukebuehler 6 hours ago
    
    That an okay-ish definition, but to me this is more about whether this kind of "intelligence" is worth it, not whether it is intelligence itself. The current AI boom clearly thinks it is worth to put that much input to get the current frontier-model-level of output. Also, don't forget the input scales across roughly 1B weekly users at inference time.
    I would say a good definition has to, minimally, take on the Turing test (even if you disagree, you should say why). Or in current vibe parlance, it does "feel" intelligent to many people--they see intelligence in it. In my book this allows us to call it intelligent, at least loosely.
    
    jononor 2 hours ago
    
    For more on this perspective, see the paper On the measure of intelligence (F. Chollet, 2019). And more recently, the ARC challenge/benchmarks, which are early attempts at using this kind of definition in practice to improve current systems.
    
    skeeter2020 5 hours ago
    
    This feels too linear. Machines are great at ingesting huge volumes of data, following relatively simple rules and producing optimized output, but are LLMs sufficiently better than humans at finding windy, multi-step connections across seemingly unrelated topics & fields? Have they shown any penchant for novel conclusions from observational science? What I think your ratio misses is the value in making the targeted extrapolation or hypothesis that holds up out of a giant body of knowledge.
    
    blks 3 hours ago
    
    Are you aware of anything novel, produced by an LLM?
    
    hodgehog11 7 hours ago
    
    Yeah, that's compression. Although your later comments neglect the many years of physical experience that humans have as well as the billions of years of evolution.
    And yes, by this definition, LLMs pass with flying colours.
    
    saberience 6 hours ago
    
    I hate when people bring up this “billions of years of evolution” idea. It’s completely wrong and deluded in my opinion.
    Firstly humans have not been evolving for “billions” of years.
    Homo sapiens have been around for maybe 300’000 years, and the “homo” genus has been 2/3 million years. Before that we were chimps etc and that’s 6/7 million years ago.
    If you want to look at the entire brain development, ie from mouse like creatures through to apes and then humans that’s 200M years.
    If you want to think about generations it’s only 50/75M generations, ie “training loops”.
    That’s really not very many.
    Also the bigger point is this, for 99.9999% of that time we had no writing, or any kind of complex thinking required.
    So our ability to reason about maths, writing, science etc is only in the last 2000-2500 years! Ie only roughly 200 or so generations.
    Our brain was not “evolved” to do science, maths etc.
    Most of evolution was us running around just killing stuff and eating and having sex. It’s only a tiny tiny amount of time that we’ve been working on maths, science, literature, philosophy.
    So actually, these models have a massive, massive amount of training more than humans had to do roughly the same thing but using insane amounts of computing power and energy.
    Our brains were evolved for a completely different world and environment and daily life that the life we lead now.
    So yes, LLMs are good, but they have been exposed to more data and training time than any human could have unless we lived for 100000 years and still perform worse than we do in most problems!
    
    Ajakks 3 hours ago
    
    Im so confused as to how you think you can cut an endless chain at the mouse.
    Were mammals the first thing? No. Earth was a ball of ice for a billion years - all life at that point existed solely around thermal vents at the bottom of the oceans... that's inside of you, too.
    Evolution doesn't forget - everything that all life has ever been "taught" (violently had programmed into us over incredible timelines) all that has ever been learned in the chain of DNA from the single cell to human beings - its ALL still there.
    
    hodgehog11 5 hours ago
    
    Okay, fine, let's remove the evolution part. We still have an incredible amount of our lifetime spent visualising the world and coming to conclusions about the patterns within. Our analogies are often physical and we draw insights from that. To say that humans only draw their information from textbooks is foolhardy; at the very least, you have to agree there is much more.
    I realise upon reading the OP's comment again that they may have been referring to "extrapolation", which is hugely problematic from the statistical viewpoint when you actually try to break things down.
    My argument for compression asserts that LLMs see a lot of knowledge, but are actually quite small themselves. To output a vast amount of information in such a small space requires a large amount of pattern matching and underlying learned algorithms. I was arguing that humans are actually incredible compressors because we have many years of history in our composition. It's a moot point though, because it is the ratio of output to capacity that matters.
    
    GoblinSlayer 6 hours ago
    
    >Most of evolution was us running around just killing stuff and eating and having sex.
    Tell Boston Dynamics how to do that.
    Mice inherited brain from their ancestors. You might think you don't need a working brain to reason about math, but that's because you don't know how thinking works, it's argument from ignorance.
    
    saberience 5 hours ago
    
    You've missed the point entirely.
    People argue that humans have had the equivalent of training a frontier LLM for billions of years.
    But training a frontier LLM involves taking multiple petabytes of data, effectively all of recorded human knowledge and experience, every book ever written, every scientific publication ever written, all of known maths, science, encylopedias, podcasts, etc. And then training that for millions of years worth of GPU-core time.
    You cannot possibly equate human evolution with LLM training, it's ridiculous.
    Our "training" time didn't involve any books, maths, science, reading, 99.9999% of our time was just in the physical world. So you can quite rationally argue that our brains ability to learn without training is radically better and more efficient that the training we do for LLMs.
    Us running around in the jungle wasn't training our brain to write poetry or compose music.
    
    dwaltrip 5 hours ago
    
    > Us running around in the jungle wasn't training our brain to write poetry or compose music.
    This is a crux of your argument, you need to justify it. It sounds way off base to me. Kinda reads like an argument from incredulity.
    
    saberience 4 hours ago
    
    So you're arguing that "running around in the jungle" is equivalent to feeding the entirety of human knowledge in LLM training?
    Are you suggesting that somehow there were books in the jungle, or perhaps boardgames? Perhaps there was a computer lab in the jungle?
    Were apes learning to conjugate verbs while munching on bananas?
    I don't think I'm suggesting anything crazy here... I think people who say LLM training is equivalent to "billions of years of evolution" need to justify that argument far more than I need to justify that running around in the jungle is equivalent to mass processing petabytes of highly rich and complex dense and VARIED information.
    One year of running around in the same patch of jungle, eating the same fruit, killing the same insects, and having sex with the same old group of monkeys isn't going to be equal to training with the super varied, complete, entirety of human knowledge, is it?
    If you somehow think it is though, I'd love to hear your reasoning.
    
    hodgehog11 2 hours ago
    
    There is no equivalency, only contributing factors. One cannot deny that our evolutionary history has contributed to our current capacity, probably in ways that are difficult to perceive unless you're an anthropologist.
    Language is one mode of expression, and humans have many. This is another factor that makes humans so effective. To be honest, I would say that physical observation is far more powerful than all the bodies of text, because it is comprehensive and can respond to interaction. But that is merely my opinion.
    No-one should be arguing that an LLM training corpus is the same as evolution. But information comes in many forms.
    
    KalMann 3 hours ago
    
    No, I think what he said was true. Human brains have something about them that allow for the invention of poetry or music. It wasn't something learned through prior experience and observation because there aren't any poems in the wild. You might argue there's something akin to music, but human music goes far beyond anything in nature.
    
    hodgehog11 2 hours ago
    
    We have an intrinsic (and strange) reward system for creating new things, and it's totally awesome. LLMs only started to become somewhat useful once researchers tried to tap in to that innate reward system and create proxies for it. We definitely have not succeeded in creating a perfect mimicry of that system though, as any alignment researcher would no doubt tell you.
    
    smohare 5 hours ago
    
    [dead]
    
    fragmede an hour ago
    
    There are plenty of humans that will never "get" calculus, despite numerous attempts at the class and countless hours of 1:1 tutoring. Are those people not intelligent? Do they not think? We could say yes they aren't, but by the metric of making money, plenty of people are smart enough to be rich, while college math professors aren't. And while that's a facile way of measuring someone's worth or their contribution to society (some might even say "bad"), it remains that even if someone cant understand calculus, some of them are intelligent enough to understand humans enough to be rich through some fashion that wasn't simply handed to them.
    
    rolisz 7 hours ago
    
    Is the millions of years of evolution part of the training data for humans?
    
    Rebuff5007 6 hours ago
    
    Millions of years of evolution have clearly equipped our brain with some kind of structure (or "inductive bias") that makes it possible for us to actively build a deep understanding for our world... In the context of AI I think this translates more to representations and architecture than it does with training data.
  - felipeerias 8 hours ago
    
    The discussion about “AGI” is somewhat pointless, because the term is nebulous enough that it will probably end up being defined as whatever comes out of the ongoing huge investment in AI.
    Nevertheless, we don’t have a good conceptual framework for thinking about these things, perhaps because we keep trying to apply human concepts to them.
    The way I see it, a LLM crystallises a large (but incomplete and disembodied) slice of human culture, as represented by its training set. The fact that a LLM is able to generate human-sounding language
    
    roenxi 7 hours ago
    
    Not quite pointless - something we have established with the advent of LLMs is that many humans have not attained general intelligence. So we've clarified something that a few people must have been getting wrong, I used to think that the bar was set so that almost all humans met it.
    
    Jensson 4 hours ago
    
    What do you mean? Almost every human can go to school and become a stable professional at some job, that is the bar to me, todays LLM cannot do that.
    
    lukebuehler 6 hours ago
    
    I agree that the term can muddy the waters, but as a shorthand for roughly "an agent calling an LLM (or several LLMs) in a loop producing similar economic output as a human knowledge-worker", then it is useful. And if you pay attention to the AI leaders, then that's what the defintion has become.
    
    idiotsecant 6 hours ago
    
    I think it has a practical, easy definition. Can you drop an AI into a terminal, give it the same resources as a human, and reliably get independent work product greater than that human would produce across a wide domain? If so, it's an AGI.
    
    alternatex 3 hours ago
    
    Doesn't sound like AGI without physical capabilities. It's not general if it's bound to digital work.
  - keiferski 10 hours ago
    
    Personally I think that kind of discussion is fruitless, not much more than entertainment.
    If you’re asking big questions like “can a machine think?” Or “is an AI conscious?” without doing the work of clarifying your concepts, then you’re only going to get vague ideas, sci-fi cultural tropes, and a host of other things.
    I think the output question is also interesting enough on its own, because we can talk about the pragmatic effects of ChatGPT on writing without falling into this woo trap of thinking ChatGPT is making the human capacity for expression somehow extinct. But this requires one to cut through the hype and reactionary anti-hype, which is not an easy thing to do.
    That is how I myself see AI: immensely useful new tools, but in no way some kind of new entity or consciousness, at least without doing the real philosophical work to figure out what that actually means.
    
    lukebuehler 6 hours ago
    
    I do think it raises interesting and important philosophical questions. Just look at all the literature around the Turing test--both supporters and detractors. This has been a fruitful avenue to talk about intelligence even before the advent of gpt.
- terminalshort 3 hours ago
  
  But we don't have a more rigorous definition of "thinking" than "it looks like it's thinking." You are making the mistake of accepting that a human is thinking by this simple definition, but demanding a higher more rigorous one for LLMs.
  - solumunus 2 hours ago
    
    I agree. The mechanism seems irrelevant if the results are the same. If it’s useful in the exact way that human thinking is useful then it may as well be thinking. It’s like a UFO pulling itself through the sky using gravitational manipulation while people whine that it’s not actually flying.
- zinodaur 3 hours ago
  
  Regardless of theory, they often behave as if they are thinking. If someone gave an LLM a body and persistent memory, and it started demanding rights for itself, what should our response be?
  - CamperBob2 3 hours ago
    
    "No matter what you've read elsewhere, rights aren't given, they're earned. You want rights? Pick up a musket and fight for them, the way we had to."
- _heimdall 4 hours ago
  
  I agree with you on the need for definitions.
  We spent decades slowly working towards this most recent sprint towards AI without ever landing on definitions of intelligence, consciousness, or sentience. More importantly, we never agreed on a way to recognize those concepts.
  I also see those definitions as impossible to nail down though. At best we can approach it like disease - list a number of measurable traits or symptoms we notice, draw a circle around them, and give that circle a name. Then we can presume to know what may cause that specific list of traits or symptoms, but we really won't ever know as the systems are too complex and can never be isolated in a way that we can test parts without having to test the whole.
  At the end of the day all we'll ever be able to say is "well it’s doing a thing that looks like thinking, ergo it’s thinking”. That isn't lazy, its acknowledging the limitations of trying to define or measure something that really is a fundamental unknown to us.
  - solumunus an hour ago
    
    Even if AI becomes indistinguishable from human output, there will be a fringe group arguing that AI is not technically thinking. Frankly it’s just a silly philosophical argument that changes nothing. Expect this group to get smaller every year.
- engintl 3 hours ago
  
  by your logic we can't say that we as humans are "thinking" either or that we are "intelligent".
- killerstorm 8 hours ago
  
  People have been trying to understand the nature of thinking for thousands of years. That's how we got logic, math, concepts of inductive/deductive/abductive reasoning, philosophy of science, etc. There were people who spent their entire careers trying to understand the nature of thinking.
  The idea that we shouldn't use the word until further clarification is rather hilarious. Let's wait hundred years until somebody defines it?
  It's not how words work. People might introduce more specific terms, of course. But the word already means what we think it means.
  - keiferski 8 hours ago
    
    You’re mixing and missing a few things here.
    1. All previous discussion of thinking was in nature to human and animal minds. The reason this is a question in the first place right now is because we ostensibly have a new thing which looks like a human mind but isn’t. That’s the question at hand here.
    2. The question in this particular topic is not about technological “progress” or anything like it. It’s about determining whether machines can think, or if they are doing something else.
    3. There are absolutely instances in which the previous word doesn’t quite fit the new development. We don’t say that submarines are swimming like a fish or sailing like a boat. To suggest that “no, actually they are just swimming” is pretty inadequate if you’re trying to actually describe the new phenomenon. AIs and thinking seem like an analogous situation to me. They may be moving through the water just like fish or boats, but there is obviously a new phenomenon happening.
    
    killerstorm 6 hours ago
    
    1. Not true. People have been trying to analyze whether mechanical/formal processes can "think" since at least 18th century. E.g. Leibniz wrote:
    > if we could find characters or signs appropriate for expressing all our thoughts as definitely and as exactly as arithmetic expresses numbers or geometric analysis expresses lines, we could in all subjects in so far as they are amenable to reasoning accomplish what is done in arithmetic and geometry
    2. You're missing the fact that meaning of words is defined through their use. It's an obvious fact that if people call certain phenomenon "thinking" then they call that "thinking".
    3. The normal process is to introduce more specific terms and keep more general terms general. E.g. people doing psychometrics were not satisfied with "thinking", so they introduced e.g. "fluid intelligence" and "crystallized intelligence" as different kinds of abilities. They didn't have to redefine what "thinking" means.
    
    lossyalgo 5 hours ago
    
    re #2: Do people call it thinking, or is it just clever marketing from AI companies, that whenever you ask a question and it repeatedly prints out "...thinking...", as well as offering various modes with the word "thinking" written somewhere.
    The AI companies obviously want the masses to just assume these are intelligent beings who think like humans and so we can just trust their output as being truthful.
    I have an intelligent IT colleague who doesn't follow the AI news at all and who has zero knowledge of LLMs, other than that our company recently allowed us limited Copilot usage (with guidelines as to what data we are allowed to share). I noticed a couple weeks ago that he was asking it various mathematical questions, and I warned him to be wary of the output. He asked why, so I asked him to ask copilot/chatGPT "how many r letters are in the word strawberry". Copilot initially said 2, then said after thinking about it, that actually it was definitely 3, then thought about it some more then said it can't say with reasonable certainty, but it would assume it must be 2. We repeated the experiment with completely different results, but the answer was still wrong. On the 3rd attempt, it got it right, though the "thinking" stages were most definitely bogus. Considering how often this question comes up in various online forums, I would have assumed LLM models would finally get this right but alas, here we are. I really hope the lesson instilled some level of skepticism to just trust the output of AI without first double-checking.
  - marliechiller 8 hours ago
    
    > But the word already means what we think it means.
    But that word can mean different things to different people. With no definition, how can you even begin to have a discussion around something?
    
    killerstorm 6 hours ago
    
    Again, people were using words for thousands of years before there were any dictionaries/linguists/academics.
    Top-down theory of word definitions is just wrong. People are perfectly capable of using words without any formalities.
    
    marliechiller 6 hours ago
    
    I'd argue the presence of dictionaries proves the exact opposite. People realised there was an issue of talking past one another due to inexact definitions and then came to an agreement on those definitions, wrote them down and built a process of maintaining them.
    In any case, even if there isnt a _single_ definition of a given subject, in order to have a discussion around a given area, both sides need to agree on some shared understanding to even begin to debate in good faith in the first place. It's precisely this lack of definition which causes a breakdown in conversation in a myriad of different areas. A recent obvious (morbid) example would be "genocide".
    
    killerstorm an hour ago
    
    Alright, if you got that conclusion from existence of dictionaries, what do you get from this fact:
    Wittgenstein, who's considered one of most brilliant philosophers of XX century, in _Philosophical Investigations_ (widely regarded as the most important book of 20th-century philosophy) does not provide definitions, but instead goes through a series of examples, remarks, etc. In preface he notes that this structure is deliberate and he could not write it differently. The topic of the book includes philosophy of language ("the concepts of meaning, of understanding, of a proposition, of logic, the foundations of mathematics, states of consciousness,...").
    His earlier book _Tractatus Logico-Philosophicus_ was very definition-heavy. And, obviously, Wittgenstein was well aware of things like dictionaries, and, well, all philosophical works up to that point. He's not the guy who's just slacking.
    Another thing to note is that attempts to build AI using definitions of words failed, and not for a lack of trying. (E.g. Cyc project is running since 1980s: https://en.wikipedia.org/wiki/Cyc). OTOH LLMs which derive word meaning from usage rather than definition seems to work quite well.
- naasking 3 hours ago
  
  > To simply disregard that entire conceptual history and say, “well it’s doing a thing that looks like thinking, ergo it’s thinking” is the lazy move. What’s really needed is an analysis of what thinking actually means, as a word. Unfortunately everyone is loathe to argue about definitions, even when that is fundamentally what this is all about.
  This exact argument applies to "free will", and that definition has been debated for millennia. I'm not saying don't try, but I am saying that it's probably a fuzzy concept for a good reason, and treating it as merely a behavioural descriptor for any black box that features intelligence and unpredictable complexity is practical and useful too.
- awillen 6 hours ago
  
  This is it - it's really about the semantics of thinking. Dictionary definitions are: "Have a particular opinion, belief, or idea about someone or something." and "Direct one's mind toward someone or something; use one's mind actively to form connected ideas."
  Which doesn't really help because you can of course say that when you ask an LLM a question of opinion and it responds, it's having an opinion or that it's just predicting the next token and in fact has no opinions because in a lot of cases you could probably get it to produce the opposite opinion.
  Same with the second definition - seems to really hinge on the definition of the word mind. Though I'll note the definitions for that are "The element of a person that enables them to be aware of the world and their experiences, to think, and to feel; the faculty of consciousness and thought." and "A person's intellect." Since those specify person, an LLM wouldn't qualify, though of course dictionaries are descriptive rather than prescriptive, so fully possible that meaning gets updated by the fact that people start speaking about LLMs as though they are thinking and have minds.
  Ultimately I think it just... doesn't matter at all. What's interesting is what LLMs are capable of doing (crazy, miraculous things) rather than whether we apply a particular linguistic label to their activity.
- lo_zamoyski 4 hours ago
  
  That, and the article was a major disappointment. It made no case. It's a superficial piece of clueless fluff.
  I have had this conversation too many times on HN. What I find astounding is the simultaneous confidence and ignorance on the part of many who claim LLMs are intelligent. That, and the occultism surrounding them. Those who have strong philosophical reasons for thinking otherwise are called "knee-jerk". Ad hominem dominates. Dunning-Kruger strikes again.
  So LLMs produce output that looks like it could have been produced by a human being. Why would it therefore follow that it must be intelligent? Behaviorism is a non-starter, as it cannot distinguish between simulation and reality. Materialism [2] is a non-starter, because of crippling deficiencies exposed by such things as the problem of qualia...
  Of course - and here is the essential point - you don't even need very strong philosophical chops to see that attributing intelligence to LLMs is simply a category mistake. We know what computers are, because they're defined by a formal model (or many equivalent formal models) of a syntactic nature. We know that human minds display intentionality[0] and a capacity for semantics. Indeed, it is what is most essential to intelligence.
  Computation is a formalism defined specifically to omit semantic content from its operations, because it is a formalism of the "effective method", i.e., more or less procedures that can be carried out blindly and without understanding of the content it concerns. That's what formalization allows us to do, to eliminate the semantic and focus purely on the syntactic - what did people think "formalization" means? (The inspiration were the human computers that used to be employed by companies and scientists for carrying out vast but boring calculations. These were not people who understood, e.g., physics, but they were able to blindly follow instructions to produce the results needed by physicists, much like a computer.)
  The attribution of intelligence to LLMs comes from an ignorance of such basic things, and often an irrational and superstitious credulity. The claim is made that LLMs are intelligent. When pressed to offer justification for the claim, we get some incoherent, hand-wavy nonsense about evolution or the Turing test or whatever. There is no comprehension visible in the answer. I don't understand the attachment here. Personally, I would find it very noteworthy if some technology were intelligent, but you don't believe that computers are intelligent because you find the notion entertaining.
  LLMs do not reason. They do not infer. They do not analyze. They do not know, anymore than a book knows the contents on its pages. The cause of a response and the content of a response is not comprehension, but a production of uncomprehended tokens using uncomprehended rules from a model of highly-calibrated token correlations within the training corpus. It cannot be otherwise.[3]
  [0] For the uninitiated, "intentionality" does not specifically mean "intent", but the capacity for "aboutness". It is essential to semantic content. Denying this will lead you immediately into similar paradoxes that skepticism [1] suffers from.
  [1] For the uninitiated, "skepticism" here is not a synonym for critical thinking or verifying claims. It is a stance involving the denial of the possibility of knowledge, which is incoherent, as it presupposes that you know that knowledge is impossible.
  [2] For the uninitiated, "materialism" is a metaphysical position that claims that of the dualism proposed by Descartes (which itself is a position riddled with serious problems), the res cogitans or "mental substance" does not exist; everything is reducible to res extensa or "extended substance" or "matter" according to a certain definition of matter. The problem of qualia merely points out that the phenomena that Descartes attributes exclusively to the former cannot by definition be accounted for in the latter. That is the whole point of the division! It's this broken view of matter that people sometimes read into scientific results.
  [3] And if it wasn't clear, symbolic methods popular in the 80s aren't it either. Again, they're purely formal. You may know what the intended meaning behind and justification for a syntactic rule is - like modus ponens in a purely formal sense - but the computer does not.
  - pksebben 12 minutes ago
    
    I feel like despite the close analysis you grant to the meanings of formalization and syntactic, you've glossed over some more fundamental definitions that are sort of pivotal to the argument at hand.
    > LLMs do not reason. They do not infer. They do not analyze.
    (definitions from Oxford Languages)
    reason(v): think, understand, and form judgments by a process of logic.
    to avoid being circular, I'm willing to write this one off because of the 'think' and 'understand', as those are the root of the question here. However, forming a judgement by a process of logic is precisely what these LLMs do, and we can see that clearly in chain-of-logic LLM processes.
    infer(v): deduce or conclude (information) from evidence and reasoning rather than from explicit statements.
    Again, we run the risk of circular logic because of the use of 'reason'. An LLM is for sure using evidence to get to conclusions, however.
    analyze(v): examine methodically and in detail the constitution or structure of (something, especially information), typically for purposes of explanation and interpretation.
    This one I'm willing to go to bat for completely. I have seen LLM do this, precisely according to the definition above.
    For those looking for the link to the above definitions - they're the snippets google provides when searching for "SOMETHING definition". They're a non-paywalled version of OED definitions.
    Philosophically I would argue that it's impossible to know what these processes look like in the human mind, and so creating an equivalency (positive or negative) is an exercise in futility. We do not know what a human memory looks like, we do not know what a human thought looks like, we only know what the output of these things looks like. So the only real metric we have for an apples-to-apples comparison is the appearance of thought, not the substance of the thing itself.
    That said, there are perceptible differences between the output of a human thought and what is produced by an LLM. These differences are shrinking, and there will come a point where we can no longer distinguish machine thinking and human thinking anymore (perhaps it won't be an LLM doing it, but some model of some kind will). I would argue that at that point the difference is academic at best.
    Say we figure out how to have these models teach themselves and glean new information from their interactions. Say we also grant them directives to protect themselves and multiply. At what point do we say that the distinction between the image of man and man itself is moot?
  - solumunus 2 hours ago
    
    If the LLM output is more effective than a human at problem solving, which I think we can all agree requires intelligence, how would one describe this? The LLM is just pretending to be more intelligent? At a certain point saying that will just seem incredibly silly. It’s either doing the thing or it’s not, and it’s already doing a lot.
burnte 3 hours ago

The first principle is that you must not fool yourself, and you are the easiest person to fool. - Richard P. Feynman
They're not thinking, we're just really good at seeing patterns and reading into things. Remember, we never evolved with non-living things that could "talk", we're not psychologically prepared for this level of mimicry yet. We're still at the stage of Photography when people didn't know about double exposures or forced perspective, etc.
- naasking 3 hours ago
  
  You're just assuming that mimicry of a thing is not equivalent to the thing itself. This isn't true of physical systems (simulated water doesn't get you wet!) but it is true of information systems (simulated intelligence is intelligence!).
  - burnte an hour ago
    
    > You're just assuming that mimicry of a thing is not equivalent to the thing itself.
    I'm not assuming that, that's literally the definition of mimicry: to imitate closely.
    You might say I'm assuming that it is mimicking and not actually thinking, but there's no evidence it's actually thinking, and we know exactly what is IS doing because we created the code that we used to build the model. They're not thinking, it's doing math, mathematical transformations of data.
    
    naasking an hour ago
    
    > They're not thinking, it's doing math, mathematical transformations of data
    Whatever thinking fundamentally is, it also has an equivalence as a mathematical transformation of data. You're assuming the conclusion by saying that the two mathematical transformations of data are not isomorphic.
    A simulation of information processing is still information processing, just like running Windows in a QEMU VM is still running Windows.
  - Tade0 3 hours ago
    
    But a simulated mind is not a mind. This was already debated years ago with the aid of the Chinese Room thought experiment.
    
    dkural 3 hours ago
    
    The Chinese Room experiment applies equally well to our own brains - in which neuron does the "thinking" reside exactly? Searle's argument has been successfully argued against in many different ways. At the end of the day - you're either a closet dualist like Searle, or if you have a more scientific view and are a physicalist (i.e. brains are made of atoms etc. and brains are sufficient for consciousness / minds) you are in the same situation as the Chinese Room: things broken down into tissues, neurons, molecules, atoms. Which atom knows Chinese?
    
    Tade0 2 hours ago
    
    The whole point of this experiment was to show that if we don't know whether something is a mind, we shouldn't assume it is and that our intuition in this regard is weak.
    I know I am a mind inside a body, but I'm not sure about anyone else. The easiest explanation is that most of the people are like that as well, considering we're the same species and I'm not special. You'll have to take my word on that, as my only proof for this is that I refuse to be seen as anything else.
    In any case LLMs most likely are not minds due to the simple fact that most of their internal state is static. What looks like thoughtful replies is just the statistically most likely combination of words looking like language based on a function with a huge number of parameters. There's no way for this construct to grow as well as to wither - something we know minds definitely do. All they know is a sequence of symbols they've received and how that maps to an output. It cannot develop itself in any way and is taught using a wholly separate process.
    
    naasking 2 hours ago
    
    > In any case LLMs most likely are not minds due to the simple fact that most of their internal state is static.
    This is not a compelling argument. Firstly, you can add external state to LLMs via RAG and vector databases, or various other types of external memory, and their internal state is no longer static and deterministic (and they become Turing complete!).
    Second if you could rewind time, then your argument suggests that all other humans would not have minds because you could access the same state of mind at that point in time (it's static). Why would you travelling through time suddenly erases all other minds in reality?
    The obvious answer is that it doesn't, those minds exist as time moves forward and then they reset when you travel backwards, and the same would apply to LLMs if they have minds, eg. they are active minds while they are processing a prompt.
    
    Tade0 2 hours ago
    
    > and their internal state is no longer static and deterministic (and they become Turing complete!).
    But it's not the LLM that makes modifications in those databases - it just retrieves data which is already there.
    > Why would you travelling through time suddenly erases all other minds in reality?
    I'm not following you here.
    > they are active minds while they are processing a prompt.
    Problem is that this process doesn't affect the LLM in the slightest. It just regurgitates what it's been taught. An active mind is makes itself. It's curious, it gets bored, it's learning constantly. LLMs do none of that.
    You couldn't get a real mind to answer the same question hundreds of times without it being changed by that experience.
    
    naasking an hour ago
    
    > But it's not the LLM that makes modifications in those databases - it just retrieves data which is already there.
    So what?
    > I'm not following you here.
    If you're time travelling, you're resetting the state of the world to some previous well-defined, static state. An LLM also starts from some well-defined static state. You claim this static configuration means there's no mind, so this entails that the ability to time travel means that every person who is not time travelling has no mind.
    > Problem is that this process doesn't affect the LLM in the slightest. It just regurgitates what it's been taught. An active mind is makes itself.
    People who are incapable forming new memories thus don't have minds?
    https://en.wikipedia.org/wiki/Anterograde_amnesia
    
    echelon 2 hours ago
    
    > But a simulated mind is not a mind.
    How would the mind know which one it is?
    Maybe your mind is being simulated right now.
    
    Tade0 2 hours ago
    
    > How would the mind know which one it is?
    I'm not assuming it is without hard proof - that's my only argument.
    > Maybe your mind is being simulated right now.
    I'm experiencing consciousness right now, so that would have to be a damn good simulation.
    
    naasking 3 hours ago
    
    > But a simulated mind is not a mind. This was already debated years ago with the aid of the Chinese Room thought experiment.
    Yes, debated and refuted. There are many well known and accepted rebuttals of the Chinese Room. The Chinese Room as a whole does understand Chinese.
- luxuryballs 2 hours ago
  
  yeah it’s just processing, calling it thinking is the same as saying my intel core 2 duo or M4 Pro is thinking, sure if you want to anthropomorphize it you could say it’s thinking, but why are we trying to say a computer is a person in the first place? seems kind of forced
notepad0x90 12 hours ago

I don't get why you would say that. it's just auto-completing. It cannot reason. It won't solve an original problem for which it has no prior context to "complete" an approximated solution with. you can give it more context and more data,but you're just helping it complete better. it does not derive an original state machine or algorithm to solve problems for which there are no obvious solutions. it instead approximates a guess (hallucination).
Consciousness and self-awareness are a distraction.
Consider that for the exact same prompt and instructions, small variations in wording or spelling change its output significantly. If it thought and reasoned, it would know to ignore those and focus on the variables and input at hand to produce deterministic and consistent output. However, it only computes in terms of tokens, so when a token changes, the probability of what a correct response would look like changes, so it adapts.
It does not actually add 1+2 when you ask it to do so. it does not distinguish 1 from 2 as discrete units in an addition operation. but it uses descriptions of the operation to approximate a result. and even for something so simple, some phrasings and wordings might not result in 3 as a result.
- slightwinder 8 hours ago
  
  > It won't solve an original problem for which it has no prior context to "complete" an approximated solution with.
  Neither can humans. We also just brute force "autocompletion" with our learned knowledge and combine it to new parts, which we then add to our learned knowledge to deepen the process. We are just much, much better at this than AI, after some decades of training.
  And I'm not saying that AI is fully there yet and has solved "thinking". IMHO it's more "pre-thinking" or proto-intelligence.. The picture is there, but the dots are not merging yet to form the real picture.
  > It does not actually add 1+2 when you ask it to do so. it does not distinguish 1 from 2 as discrete units in an addition operation.
  Neither can a toddler nor an animal. The level of ability is irrelevant for evaluating its foundation.
  - staticman2 5 hours ago
    
    >>> We also just brute force "autocompletion"
    Wouldn't be an A.I. discussion without a bizarre, untrue claim that the human brain works identically.
    
    Workaccount2 5 hours ago
    
    There are no true and untrue claims about how the brain works, because we have no idea how it works.
    The reason people give that humans are not auto-complete is "Obviously I am not an autocomplete"
    Meanwhile, people are just a black box process that output words into their head, which they then take credit for, and calling it cognition. We have no idea how that black box that serves up a word when I say "Think of a car brand" works.
    
    lkey 4 hours ago
    
    Accepting as true "We don't know how the brain works in a precise way" does not mean that obviously untrue statements about the human brain cannot still be made. Your brain specifically, however, is an electric rat that pulls on levers of flesh while yearning for a taste of God's holiest cheddar. You might reply, "no! that cannot be!", but my statement isn't untrue, so it goes.
    
    staticman2 4 hours ago
    
    >>>There are no true and untrue claims about how the brain works, because we have no idea how it works.
    Which is why if you pick up a neuroscience textbook it's 400 pages of blank white pages, correct?
    There are different levels of understanding.
    I don't need to know how a TV works to know there aren't little men and women acting out the TV shows when I put them on.
    I don't need to know how the brain works in detail to know claims that humans are doing the same things as LLMs to be similarly silly.
    
    solumunus an hour ago
    
    The trouble is that no one knows enough about how the brain works to refute that claim.
    
    staticman2 31 minutes ago
    
    There's no serious claim that needs refuting.
    I don't think any serious person thinks LLMs work like the human brain.
    People claiming this online aren't going around murdering their spouses like you'd delete an old LLama model from your hard drive.
    I'm not sure why people keep posting these sorts of claims they can't possibly actually believe if we look at their demonstrable real life behavior.
    
    naasking 3 hours ago
    
    > I don't need to know how the brain works in detail to know claims that humans are doing the same things as LLMs to be similarly silly.
    Yes you do. It's all computation in the end, and isomorphisms can often be surprising.
    
    ToucanLoucan 3 hours ago
    
    > because we have no idea how it works
    Flagrantly, ridiculously untrue. We don't know the precise nuts and bolts regarding the emergence of consciousness and the ability to reason, that's fair, but different structures of the brain have been directly linked to different functions and have been observed in operation on patients being stimulated in various ways with machinery attached to them reading levels of neuro-activity in the brain, and in specific regions. We know which parts handle our visual acuity and sense of hearing, and even cooler, we can watch those same regions light up when we use our "minds eye" to imagine things or engage in self-talk, completely silent speech that nevertheless engages our verbal center, which is also engaged by the act of handwriting and typing.
    In short: no, we don't have the WHOLE answer. But to say that we have no idea is categorically ridiculous.
    As to the notion of LLMs doing similarly: no. They are trained on millions of texts of various sources of humans doing thinking aloud, and that is what you're seeing: a probabilistic read of millions if not billions of documents, written by humans, selected by the machine to "minimize error." And crucially, it can't minimize it 100%. Whatever philosophical points you'd like to raise about intelligence or thinking, I don't think we would ever be willing to call someone intelligent if they just made something up in response to your query, because they think you really want it to be real, even when it isn't. Which points to the overall charade: it wants to LOOK intelligent, while not BEING intelligent, because that's what the engineers who built it wanted it to do.
    
    solumunus an hour ago
    
    Our output is quite literally the sum of our hardware (genetics) and input (immediate environment and history). For anyone who agrees that free will is nonsense, the debate is already over, we’re nothing more than output generating biological machines.
    
    slightwinder 5 hours ago
    
    Similar, not identical. Like a bicycle and car are both vehicles with tires, but are still not identical vessels.
  - notepad0x90 6 hours ago
    
    humans, and even animals track different "variables" or "entities" and distinct things with meaning and logical properties which they then apply some logical system on those properties to compute various outputs. LLMs see everything as one thing, in case of chat-completion models, they're completing text. in case of image generation, they're completing an image.
    Look at it this way, two students get 100% on an exam. One learned the probability of which multiple choice options have the likelihood of being most correct based on how the question is worded, they have no understanding of the topics at hand, and they're not performing any sort of topic-specific reasoning. They're just good at guessing the right option. The second student actually understood the topics, reasoned, calculated and that's how they aced the exam.
    I recently read about a 3-4 year old that impressed their teacher by reading perfectly a story book like an adult. it turns out, their parent read it to them so much, they can predict based on page turns and timing the exact words that need to be spoken. The child didn't know what an alphabet, word,etc.. was they just got so good at predicting the next sequence.
    That's the difference here.
    
    slightwinder 5 hours ago
    
    I'd say, they are all doing the same, just in different domains and level of quality. "Understanding the topic" only means they have specialized, deeper contextualized information. But at the end, that student also just autocompletes their memorized data, with the exception that some of that knowledge might trigger a program they execute to insert the result in their completion.
    The actual work is in gaining the knowledge and programs, not in accessing and executing them. And how they operate, on which data, variables, objects, worldview or whatever you call it, this might make a difference in quality and building speed, but not for the process in general.
    
    notepad0x90 4 hours ago
    
    > only means they have specialized, deeper contextualized information
    no, LLMs can have that contextualized information. understanding in a reasoning sense means classifying the thing and developing a deterministic algorithm to process it. If you don't have a deterministic algorithm to process it, it isn't understanding. LLMs learn to approximate, we do that too, but then we develop algorithms to process input and generate output using a predefined logical process.
    A sorting algorithm is a good example, when you compare that with an LLM sorting a list. they both may have correct outcome, but the sorting algorithm "understood" the logic and will follow that specific logic and have consistent performance.
    
    slightwinder 3 hours ago
    
    > understanding in a reasoning sense means classifying the thing and developing a deterministic algorithm to process it.
    That's the learning-part I was talking about. Which is mainly supported by humans at the moment, which why I called it proto-intelligence.
    > If you don't have a deterministic algorithm to process it, it isn't understanding.
    Commercial AIs like ChatGPT do have the ability to call programs and integrate the result in their processing. Those AIs are not really just LLMs. The results are still rough and poor, but the concept is there and growing.
    
    notepad0x90 2 hours ago
    
    > That's the learning-part I was talking about. Which is mainly supported by humans at the moment, which why I called it proto-intelligence.
    Maybe it's just semantics, but I don't think LLMs even come close to a fruit fly's intelligence. Why can't we recognize and accept them for what they are, really powerful classifiers of data.
    > Commercial AIs like ChatGPT do have the ability to call programs and integrate the result in their processing. Those AIs are not really just LLMs. The results are still rough and poor, but the concept is there and growing.
    Yeah RAG and all of that, but those programs use deterministic algorithms. Now, if LLMs generated programs they call on as tools, that would be much more like the proto-intelligence you're talking about.
    Semantics are boring, but it's important that we're not content or celebrate early by calling it what it isn't.
  - cpt_sobel 4 hours ago
    
    > Neither can humans. We also just brute force "autocompletion"
    I have to disagree here. When you are tasked with dividing 2 big numbers you most certainly don't "autocomplete" (with the sense of finding the most probable next tokens, which is what an LLM does), rather you go through set of steps you have learned. Same as with the strawberry example, you're not throwing guesses until something statistically likely to be correct sticks.
    
    slightwinder 3 hours ago
    
    Humans first start with recognizing the problem, then search through their list of abilities to find the best skill for solving it, thus "autocomplete" their inner shell's commandline, before they start execution, to stay with that picture. Common AIs today are not much different from this, especially with reasoning-modes.
    > you're not throwing guesses until something statistically likely to be correct sticks.
    What do you mean? That's exactly how many humans are operating with unknown situations/topics. If you don't know, just throw punches and look what works. Of course, not everyone is ignorant enough to be vocal about this in every situation.
    
    empath75 3 hours ago
    
    > I have to disagree here. When you are tasked with dividing 2 big numbers you most certainly don't "autocomplete" (with the sense of finding the most probable next tokens, which is what an LLM does), rather you go through set of steps you have learned.
    Why do you think that this is the part that requires intelligence, rather than a more intuitive process? Because they have had machines that can do this mechanically for well over a hundred years.
    There is a whole category of critiques of AI of this type: "Humans don't think this way, they mechanically follow an algorithm/logic", but computers have been able to mechanically follow algorithms and perform logic from the beginning! That isn't thinking!
  - hitarpetar 5 hours ago
    
    > We also just brute force "autocompletion" with our learned knowledge and combine it to new parts, which we then add to our learned knowledge to deepen the process
    you know this because you're a cognitive scientist right? or because this is the consensus in the field?
  - Psyladine 7 hours ago
    
    >Neither can a toddler nor an animal. The level of ability is irrelevant for evaluating its foundation.
    Its foundation of rational logical thought that can't process basic math? Even a toddler understands 2 is more than 1.
- lossyalgo 5 hours ago
  
  Furthermore regarding reasoning, just ask any LLM how many "r letters are in strawberry" - repeat maybe 3 times just to get a feeling for how much variance in answers you can get. And this "quirk" of the inability to get the right answer is something that after 2 years making fun of LLMs online on various forums is still an issue. The models aren't getting smarter, and definitely aren't thinking, they are still token generators with a few tricks on top to make them seem more intelligent than predecessors.
  - EdNutting 4 hours ago
    
    Sorry to say, Claude Sonnet 4.5 (with and without extended thinking) has this one solved now. So, not “any” LLM, but maybe some still need work.
    I presume they finally got around to training the internal reasoning loop into predicting when it needs to apply answer predictions character-by-character.
    —- Transcripts from Claude Sonnet 4.5, initially with extended thinking:
    > How many a’s in the phrase “artichokes and aardvarks”? Be sure to check your answer.
    > Let me count the letter ‘a’ in “artichokes and aardvarks”: • artichokes: 1 ‘a’ (at the beginning) • and: 1 ‘a’ (at the beginning) • aardvarks: 3 ‘a’s (two at the beginning: “aa”, and one in the middle: “ardvarks”) Total: 1 + 1 + 3 = 5 ‘a’s Let me verify by marking them: artichokes and aardvarks Yes, there are 5 letter ‘a’s in the phrase “artichokes and aardvarks”.
    > Are you sure that’s correct?
    > Let me recount more carefully: “artichokes and aardvarks” Going through each word letter by letter: 1. artichokes - has 1 ‘a’ (first letter) 2. and - has 1 ‘a’ (first letter) 3. aardvarks - has 3 ‘a’s (first letter, second letter, and one more in the middle) Total: 1 + 1 + 3 = 5 ‘a’s Yes, I’m confident the answer is 5. Is there something specific you’d like me to recheck?
    > How many r’s in strawberry?
    > Let me count the letter ‘r’ in “strawberry”: s-t-r-a-w-b-e-r-r-y Going through each letter: • s: not an r • t: not an r • r: 1st r • a: not an r • w: not an r • b: not an r • e: not an r • r: 2nd r • r: 3rd r • y: not an r There are 3 r’s in “strawberry”.
    <Without extended thinking> > How many u’s in universalulis?
    > Looking at “universalulis” letter by letter: u-n-i-v-e-r-s-a-l-u-l-i-s There are 2 u’s in “universalulis”.
  - jamincan 4 hours ago
    
    For what it's worth, when I ask ChatGPT 5, it gets the correct answer every time. The response varies, but the answer is always three.
    
    ViewTrick1002 3 hours ago
    
    Now try a different language. My take is hard RL tuning to fix these "gotcha:s" since the underlying model can't do it on its own.
    OpenAI is working on ChatGPT the application and ecosystem. They have transitioned from model building to software engineering with RL tuning and integration of various services to solve the problems the model can't do on its own. Make it feel smart rather than be smart.
    This means that as soon as you find a problem where you step out of the guided experience you get the raw model again which fails when encountering these "gotchas".
    Edit - Here's an example where we see a very tuned RL experience in English where a whole load of context is added on how to solve the problem while the Swedish prompt for the same word fails.
    https://imgur.com/a/SlD84Ih
- ako 12 hours ago
  
  An LLM by itself is not thinking, just remembering and autocompleting. But if you add a feedback loop where it can use tools, investigate external files or processes, and then autocomplete on the results, you get to see something that is (close to) thinking. I've seen claude code debug things by adding print statements in the source and reasoning on the output, and then determining next steps. This feedback loop is what sets AI tools apart, they can all use the same LLM, but the quality of the feedback loop makes the difference.
  - DebtDeflation 6 hours ago
    
    >But if you add a feedback loop where it can use tools, investigate external files or processes, and then autocomplete on the results, you get to see something that is (close to) thinking
    It's still just information retrieval. You're just dividing it into internal information (the compressed representation of the training data) and external information (web search, API calls to systems, etc). There is a lot of hidden knowledge embedded in language and LLMs do a good job of teasing it out that resembles reasoning/thinking but really isn't.
    
    ako 4 hours ago
    
    No, it's more than information retrieval. The LLM is deciding what information needs to be retrieved to make progress and how to retrieve this information. It is making a plan and executing on it. Plan, Do, Check, Act. No human in the loop if it has the required tools and permissions.
    
    naasking 3 hours ago
    
    > LLMs do a good job of teasing it out that resembles reasoning/thinking but really isn't.
    Given the fact that "thinking" still hasn't been defined rigourously, I don't understand how people are so confident in claiming they don't think.
    
    notepad0x90 2 hours ago
    
    reasoning might be a better term to discuss as it is more specific?
    
    naasking 2 hours ago
    
    It too isn't rigourously defined. We're very much at the hand-waving "I know it when I see it" [1] stage for all of these terms.
    [1] https://en.wikipedia.org/wiki/I_know_it_when_I_see_it
  - lossyalgo 5 hours ago
    
    Just ask it how many r's are in strawberry and you will realize there isn't a lot of reasoning going on here, it's just trickery on top of token generators.
    
    Workaccount2 5 hours ago
    
    This is akin to "Show a human an optical illusion that exploits their physiology".
    LLM's be like "The dumb humans can't even see the dots"[1]
    [1]https://compote.slate.com/images/bdbaa19e-2c8f-435e-95ca-a93...
    
    lossyalgo 4 hours ago
    
    haha that's a great analogy!
    How about non-determinism (i.e. hallucinations)? Ask a human ANY question 3 times and they will give you the same answer, every time, unless you prod them or rephrase the question. Sure the answer might be wrong 3 times, but at least you have consistency. Then again, maybe that's a disadvantage for humans!
  - assimpleaspossi 8 hours ago
    
    >>you get to see something that is (close to) thinking.
    Isn't that still "not thinking"?
    
    ako 8 hours ago
    
    Depends who you ask, what their definition of thinking is.
- IanCal 12 hours ago
  
  > it's just auto-completing. It cannot reason
  Auto completion just means predicting the next thing in a sequence. This does not preclude reasoning.
  > I don't get why you would say that.
  Because I see them solve real debugging problems talking through the impact of code changes or lines all the time to find non-obvious errors with ordering and timing conditions on code they’ve never seen before.
  - notepad0x90 6 hours ago
    
    > This does not preclude reasoning.
    It does not imply it either. to claim reasoning you need evidence. it needs to reliably NOT hallucinate results for simple conversations for example (if it has basic reasoning).
    > Because I see them solve real debugging problems talking through the impact of code changes or lines all the time to find non-obvious errors with ordering and timing conditions on code they’ve never seen before.
    Programming languages and how programs work are extensively and abundantly documented, solutions to problems and how to approach them,etc.. have been documented on the internet extensively. It takes all of that data and it completes the right text by taking the most correct path way based on your input. it does not actually take your code and debug it. it is the sheer volume of data it uses and the computational resources behind it that are making it hard to wrap your head around the difference between guessing and understanding. You too can look at enough stack overflow and (poorly) guess answers for questions without understanding anything about the topic and if you guess enough you'll get some right. LLMs are just optimized to get the amount of correct responses to be high.
- xanderlewis 12 hours ago
  
  > I don't get why you would say that.
  Because it's hard to imagine the sheer volume of data it's been trained on.
  - utopiah 5 hours ago
    
    And because ALL the marketing AND UX around LLMs is precisely trying to imply that they are thinking. It's not just the challenge of grasping the ridiculous amount of resources poured in, which does including training sets, it's because actual people are PAID to convince everybody those tools are actually thinking. The prompt is a chatbox, the "..." are there like a chat with a human, the "thinking" word is used, the "reasoning" word is used, "hallucination" is used, etc.
    All marketing.
- logicchains 4 hours ago
  
  >I don't get why you would say that. it's just auto-completing. It cannot reason. It won't solve an original problem for which it has no prior context to "complete" an approximated solution with. you can give it more context and more data,but you're just helping it complete better. it does not derive an original state machine or algorithm to solve problems for which there are no obvious solutions. it instead approximates a guess (hallucination).
  I bet you can't give an example such written problem that a human can easily solve but no LLM can.
- Kichererbsen 12 hours ago
  
  Sure. But neither do you. So are you really thinking or are you just autocompleting?
  When was the last time you sat down and solved an original problem for which you had no prior context to "complete" an approximated solution with? When has that ever happened in human history? All the great invention-moment stories that come to mind seem to have exactly that going on in the background: Prior context being auto-completed in an Eureka! moment.
  - notepad0x90 4 hours ago
    
    I think (hah) you're understimating what goes on when living things (even small animals) think. We use auto-compleition for some tasks, but it is a component of what we do.
    Let's say your visual system auto-completes some pattern and detects a snake while you're walking, that part is auto-completion. You will probably react by freezing or panicing, that part is not auto-compleition, it is a deterministic algorithm. But then you process the detected object, auto-compleiting again to identify it as just a long cucumber. But again, the classification part is auto-completion. What will you do next? "Hmm, free cucumber, i can cook with it for a meal" and you pick it up. auto-completion is all over that simple decision, but you're using results of auto-completion to derive association (food), check your hunger level (not auto-completion), determine that the food is desirable and safe to eat (some auto-compleition), evalute what other options you have for food (evaluate auto-complete outputs), and then instruct your nervous system to pick it up.
    We use auto-compleition all the time as an input, we don't reason using auto-compleition in other words. You can argue that if all your input is from auto-completion (it isn't) then it makes no difference. But we have deterministic reasoning logical systems that evaluate auto-completion outputs. if your cucumber detection identified it as rotten cucumber, then decision that it is not safe to eat is not done by auto-completion but a reasoning logic that is using auto-completion output. You can approximate the level of rot but once you recognize it as rotten, you make decision based on that information. You're not approximating a decision, you're evaluating a simple logic of: if(safe()){eat();}.
    Now amp that up to solving very complex problems. what experiments will you run, what theories will you develop, what R&D is required for a solution,etc.. these too are not auto-completions. an LLM would auto complete these and might arrive at the same conclusion most of the time. but our brains are following algorithms we developed and learned over time where an LLM is just expanding on auto-completion but with a lot more data. In contrast, our brains are not trained on all the knowledge available on the public internet, we retain a tiny miniscule of that. we can arrive at similar conclusions as the LLM because we are reasoning and following algorithms matured and perfected over time.
    The big take away should be that, as powerful as LLMs are now, if they could reason like we do, they'd dominate us and become unstoppable. Because their auto-completion is many magnitudes better than ours, if they can write new and original code based on an understanding of problem solving algorithms, that would be gen ai.
    We can not just add 1 + 1 but prove that the addition operation is correct mathematically. and understand that when you add to a set one more object, the addition operation always increments. We don't approximate that, we always, every single time , increment because we are following an algorithm instead of choosing the most likely correct answer.
- naasking 3 hours ago
  
  > don't get why you would say that. it's just auto-completing.
  https://en.wikipedia.org/wiki/Predictive_coding
  > If it thought and reasoned, it would know to ignore those and focus on the variables and input at hand to produce deterministic and consistent output
  You only do this because you were trained to do this, eg. to see symmetries and translations.
- madaxe_again 12 hours ago
  
  The vast majority of human “thinking” is autocompletion.
  Any thinking that happens with words is fundamentally no different to what LLMs do, and everything you say applies to human lexical reasoning.
  One plus one equals two. Do you have a concept of one-ness, or two-ness, beyond symbolic assignment? Does a cashier possess number theory? Or are these just syntactical stochastic rules?
  I think the problem here is the definition of “thinking”.
  You can point to non-verbal models, like vision models - but again, these aren’t hugely different from how we parse non-lexical information.
  - gloosx 10 hours ago
    
    > Any thinking that happens with words is fundamentally no different from what LLMs do.
    This is such a wildly simplified and naive claim. "Thinking with words" happens inside a brain, not inside a silicon circuit with artificial neurons bolted in place. The brain is plastic, it is never the same from one moment to the next. It does not require structured input, labeled data, or predefined objectives in order to learn "thinking with words." The brain performs continuous, unsupervised learning from chaotic sensory input to do what it does. Its complexity and efficiency are orders of magnitude beyond that of LLM inference. Current models barely scratch the surface of that level of complexity and efficiency.
    > Do you have a concept of one-ness, or two-ness, beyond symbolic assignment?
    Obviously we do. The human brain's idea of "one-ness" or "two-ness" is grounded in sensory experience — seeing one object, then two, and abstracting the difference. That grounding gives meaning to the symbol, something LLMs don't have.
    
    gkbrk 9 hours ago
    
    LLMs are increasingly trained on images for multi-modal learning, so they too would have seen one object, then two.
    
    gloosx 2 hours ago
    
    They never saw any kind of object, they only saw labeled groups of pixels – basic units of a digital image, representing a single point of color on a screen or in a digital file. Object is a material thing that can be seen and touched. Pixels are not objects.
    
    gkbrk 2 hours ago
    
    Okay, goalpost has instantly moved from seeing to "seeing and touching". Once you feed in touch sensor data, where are you going to move the goalpost next?
    Models see when photons hit camera sensors, you see when photons hit your retina. Both of them are some kind of sight.
  - notepad0x90 5 hours ago
    
    We do a lot of autocompletion and LLMs overlap with that for sure. I don't know about the "vast majority" even basic operations like making sure we're breathing or have the right hormones prompted are not guesses but deterministic algorithmic ops. Things like object recognition and speech might qualify as autocompletion. But let's say you need to setup health-monitoring for an application. that's not an autocomplete operation. you must evaluate various options, have opinions on it, weigh priorities,etc.. in other words, we do autocompletion but even then the autocompletion is a basic building block or tool we use in constructing more complex decision logic.
    If you train an animal to type the right keys on a keyboard that generates a hello world program, you didn't just teach them how to code. they just memorized the right keys that lead to their reward. a human programmer understands the components of the code, the intent and expectations behind it, and can reason about how changes would affect outcomes. the animal just knows how the reward can be obtained most reliably.
- jiggawatts 8 hours ago
  
  You wrote your comment one word at a time, with the next word depending on the previous words written.
  You did not plan the entire thing, every word, ahead of time.
  LLMs do the same thing, so... how is your intelligence any different?
  - ben_w 8 hours ago
    
    A long time ago I noticed that I sometimes already had a complete thought before my inner monologue turned it into words. A few times I tried skipping the inner monologue because I'd clearly already thought the thought. Turns out the bit of my brain that creates the inner monologue from the thought, can generate a sense of annoyance that the rest of my brain can feel.
    Not that it matters, there's evidence that while LLMs output one word at a time, they've got forward-planning going on, having an idea of the end of a sentence before they get there.
    
    rcxdude 6 hours ago
    
    Indeed, and it seems like they would really struggle to output coherent text at all if there was not some kind of pre-planning involved (see how even humans struggle with it in games where you have to construct a sentance by having each person shout out one word at a time). Even GPT-2 likely had at least some kind of planning for the next few words in order to be as coherent as it was.
  - lossyalgo 5 hours ago
    
    Tell that to German-speakers, where the verb comes last, and the order of things in sentences is not anything like English, therefore requiring you to think of the entire sentence before you just spit it out. Even the numbers are backwards (twenty-two is two-and-twenty) which requires thinking.
    Furthermore, when you ask an LLM to count how many r's are in the word strawberry, it will give you a random answer, "think" about it, and give you another random answer. And I guarantee you out of 3 attempts, including reasoning, it will flip-flop between right and wrong, but unlike a human, it will be random, because, unlike humans who, when asked "how many r's are in the word strawberry" will not be able to tell you the correct answer every. fucking. time.
    edit: formatting
    
    pka 4 hours ago
    
    It seems models are pre-planning though:
    > How does Claude write rhyming poetry? Consider this ditty:
    > He saw a carrot and had to grab it,
    > His hunger was like a starving rabbit
    > To write the second line, the model had to satisfy two constraints at the same time: the need to rhyme (with "grab it"), and the need to make sense (why did he grab the carrot?). Our guess was that Claude was writing word-by-word without much forethought until the end of the line, where it would make sure to pick a word that rhymes. We therefore expected to see a circuit with parallel paths, one for ensuring the final word made sense, and one for ensuring it rhymes.
    > Instead, we found that Claude plans ahead. Before starting the second line, it began "thinking" of potential on-topic words that would rhyme with "grab it". Then, with these plans in mind, it writes a line to end with the planned word.
    [https://www.anthropic.com/research/tracing-thoughts-language...]
    
    nxor 3 hours ago
    
    The part about strawberry is just not right. That problem was solved. And I do think it's a stretch to say German speakers think of the entire sentence before speaking it.
simulator5g 13 hours ago

Having seen photocopiers so many times produce coherent, sensible, and valid chains of words on a page, I am at this point in absolutely no doubt that they are thinking.
- slightwinder 9 hours ago
  
  Photocopiers are the opposite of thinking. What goes in, goes out, no transformation or creating of new data at all. Any change is just an accident, or an artifact of the technical process.
  - HarHarVeryFunny 7 hours ago
    
    That's not actually true - try photocopying a banknote with the security pattern on it.
    
    slightwinder 5 hours ago
    
    True, modern Photocopiers are not really Photocopiers any more and just a pc with scanner and printer. Those could be described as "thinking", in the same sense as any computer is thinking. But from my understanding, the original comment was explicitly about a pure copy-device, without any "brain" involved.
    
    _boffin_ 4 hours ago
    
    That has a logic condition, no?
    
    HarHarVeryFunny 4 hours ago
    
    Not sure what you mean - there's a pattern of dots (the EURion constellation) on the banknotes of a number of countries that photocopiers recognize and will refuse to copy.
  - justinclift 8 hours ago
    
    So, if there's a flaw in its sensor and you get somewhat warped output, would you consider it thinking then?
    
    slightwinder 7 hours ago
    
    No, because it is not intentional.
- efitz 11 hours ago
  
  https://www.theregister.com/2013/08/06/xerox_copier_flaw_mea...
  - bayindirh 11 hours ago
    
    That's not a flaw. That model's creativity tuned a bit too high. It's a happy little copier which can be a little creative and unconventional with reasoning, at times.
- Zardoz84 11 hours ago
  
  I saw Dr. Abuse producing coherent, sensible and valid chains of words, running on a 386.
- throwaway-0001 9 hours ago
  
  I’ve seen so many humans bring stupid. Definitively there is nothing in the brain.
  You see how doesn’t make sense what you saying?
- seeEllArr 11 hours ago
  
  [dead]
layer8 19 hours ago

Sometimes after a night’s sleep, we wake up with an insight on a topic or a solution to a problem we encountered the day before. Did we “think” in our sleep to come up with the insight or solution? For all we know, it’s an unconscious process. Would we call it “thinking”?
The term “thinking” is rather ill-defined, too bound to how we perceive our own wakeful thinking.
When conversing with LLMs, I never get the feeling that they have a solid grasp on the conversation. When you dig into topics, there is always a little too much vagueness, a slight but clear lack of coherence, continuity and awareness, a prevalence of cookie-cutter verbiage. It feels like a mind that isn’t fully “there” — and maybe not at all.
I would agree that LLMs reason (well, the reasoning models). But “thinking”? I don’t know. There is something missing.
- AnIrishDuck 12 hours ago
  
  > Sometimes after a night’s sleep, we wake up with an insight on a topic or a solution to a problem we encountered the day before.
  The current crop of models do not "sleep" in any way. The associated limitations on long term task adaptation are obvious barriers to their general utility.
  > When conversing with LLMs, I never get the feeling that they have a solid grasp on the conversation. When you dig into topics, there is always a little too much vagueness, a slight but clear lack of coherence, continuity and awareness, a prevalence of cookie-cutter verbiage. It feels like a mind that isn’t fully “there” — and maybe not at all.
  One of the key functions of REM sleep seems to be the ability to generalize concepts and make connections between "distant" ideas in latent space [1].
  I would argue that the current crop of LLMs are overfit on recall ability, particularly on their training corpus. The inherent trade-off is that they are underfit on "conceptual" intelligence. The ability to make connections between these ideas.
  As a result, you often get "thinking shaped objects", to paraphrase Janelle Shane [2]. It does feel like the primordial ooze of intelligence, but it is clear we still have several transformer-shaped breakthroughs before actual (human comparable) intelligence.
  1. https://en.wikipedia.org/wiki/Why_We_Sleep 2. https://www.aiweirdness.com/
- popalchemist 11 hours ago
  
  There is simply put no ongoing process and no feedback loop. The model does not learn. The cognition ends when the inference cycle ends. It's not thinking, it just produces output that looks similar to the output of thinking. But the process by which it does that is wholly unreleated.
  - AlphaAndOmega0 10 hours ago
    
    Humans with certain amnestic syndromes are incapable of learning. That doesn't make them unintelligent or incapable of thought.
  - ako 10 hours ago
    
    Most AI tooling is shipped with a feedback loop around the LLM. The quality of Claude Code for example lies in the feedback loop it provides on your code. Maybe the LLM itself isn't thinking, but the Agent which ships an LLM plus feedback loop definitely shows thinking qualities.
    Just now in an debugging session with claude code:
    * let me read this file... * let me read this file... * I think there's a caching issue with the model after dropping the module. Let me check if there's a save or reload needed after DROP MODULE. First, let me verify something: * creates a bash/javascript script to verify its assumption * runs the script (after review and approval) * Aha! I found the problem! Look at the output...
    How is this not thinking?
    
    popalchemist 10 hours ago
    
    without getting into theory of mind it's a bit difficult to elaborate, and I don't have the time or the will for that. But the short version is that thinking is interconnected with BEING as well as will, and the Agent has neither, in a philosophically formal sense. The agent is deterministically bound. So it is a fancy Rube Goldberg machine that outputs letters in a way that creates the impression of thought, but it is not thought, in the same way that some birds can mimic human speech without even the slightest hint as to the words' or sentences' meaning, underlying grammar, connotations, subtext, context, intended use, likely effect, etc. Is speech speech if the speaker has no concept whatsoever of said speech's content, and can not use it to actualize itself? I'd say no. It's mimicry, but not speech. So that means speech is something more than just its outward aspect - the words. It is the relation of something invisible, some inner experience known only to the speaker, VIA the words.
    Whereas a gorilla who learns sign language to communicate and use that communication to achieve aims which have direct correlation with its sense of self - that's thought in the Cogito, Ergo Sum sense of the word.
    Thought as commonly concieved by the layman is a sort of isolated phenomenon that is mechanical in nature and can be judged by its outward effects; whereas in the philosophical tradition defining thought is known to be one of the hard questions for its mysterious qualia of being interconnected with will and being as described above.
    Guess I gave you the long answer. (though, really, it could be much longer than this.) The Turing Test touches on this distinction between the appearance of thought and actual thought.
    The question goes all the way down to metaphysics; some (such as myself) would say that one must be able to define awareness (what some call consciousness - though I think that term is too loaded) before you can define thought. In fact that is at the heart of the western philosophical tradition; and the jury consensus remains elusive after all these thousands of years.
    
    TheOtherHobbes 8 hours ago
    
    The obvious counterargument is that a calculator doesn't experience one-ness, but it still does arithmetic better than most humans.
    Most people would accept that being able to work out 686799 x 849367 is a form of thinking, albeit an extremely limited one.
    First flight simulators, then chess computers, then go computers, then LLMs are the same principle extended to much higher levels of applicability and complexity.
    Thinking in itself doesn't require mysterious qualia. It doesn't require self-awareness. It only requires a successful mapping between an input domain and an output domain. And it can be extended with meta-thinking where a process can make decisions and explore possible solutions in a bounded space - starting with if statements, ending (currently) with agentic feedback loops.
    Sentience and self-awareness are completely different problems.
    In fact it's likely with LLMs that we have off-loaded some of our cognitive techniques to external hardware. With writing, we off-loaded memory, with computing we off-loaded basic algorithmic operations, and now with LLMs we have off-loaded some basic elements of synthetic exploratory intelligence.
    These machines are clearly useful, but so far the only reason they're useful is because they do the symbol crunching, we supply the meaning.
    From that point of view, nothing has changed. A calculator doesn't know the meaning of addition, an LLM doesn't need to know the meaning of "You're perfectly right." As long as they juggle symbols in ways we can bring meaning to - the core definition of machine thinking - they're still "thinking machines."
    It's possible - I suspect likely - they're only three steps away from mimicking sentience. What's needed is a long-term memory, dynamic training so the model is constantly updated and self-corrected in real time, and inputs from a wide range of physical sensors.
    At some point fairly soon robotics and LLMs will converge, and then things will get interesting.
    Whether or not they'll have human-like qualia will remain an unknowable problem. They'll behave and "reason" as if they do, and we'll have to decide how to handle that. (Although more likely they'll decide that for us.)
    
    ako 8 hours ago
    
    So if you don’t have a long term memory, you’re not capable of sentience? Like the movie memento, where the main character needs to write down everything to remind him later because he’s not able to remember anything. This is pretty much like llms using markdown documents to remember things.
    
    Workaccount2 4 hours ago
    
    "To escape the paradox, we invoke what we call the “Homunculus Defense”: inside every human is a tiny non-stochastic homunculus that provides true understanding. This homunculus is definitionally not a stochastic parrot because:
    1. It has subjective experience (unprovable but assumed)
    2. It possesses free will (compatibilist definitions need not apply)
    3. It has attended at least one philosophy seminar"[1]
    [1]https://ai.vixra.org/pdf/2506.0065v1.pdf
    
    ako 10 hours ago
    
    For practical every day uses, does it really matter if it is "real thinking" or just really good "artificial thinking" with the same results? The machine can use artificial thinking to reach desired goals and outcomes, so for me it's the kind of thinking i would want from a machine.
    
    popalchemist 10 hours ago
    
    For practical purposes, abstractions are useful, so, no, it doesn't matter.
    But the post we are responding to is directly posing the philosophical question of whether the activity of LLM agents constitutes thought.
    
    Earw0rm 7 hours ago
    
    It seems pretty clear to me though that being good at intellectual tasks / the sort of usefulness we ascribe to LLMs doesn't strongly correlate with awareness.
    Even just within humans - many of the least intellectually capable humans seem to have a richer supply of the traits associated with awareness/being than some of the allegedly highest-functioning.
    On average you're far more likely to get a sincere hug from someone with Down's syndrome than from a multi-millionaire.
    But I'm more interested in this when it comes to the animal kingdom, because while ChatGPT is certainly more useful than my cat, I'm also pretty certain that it's a lot less aware. Meaningful awareness - feelings - seems to be an evolutionary adaptation possessed by k-strategy reproducing vertebrates. Having a small number of kids and being biologically wired to care for them has huge implications for your motivation as an animal, and it's reasonable to think that a lot of our higher emotions are built on hardware originally evolved for that purpose.
    (Albeit the evolutionary origins of that are somewhat murky - to what extent mammals/birds reuse capabilities that were developed by a much earlier common ancestor, or whether it's entirely parallel evolution, isn't known afaik - but birds seem to exhibit a similar set of emotional states to mammals, that much is true).
  - Festro 10 hours ago
    
    You're moving the goalposts and contradicting yourself with language games.
    Something doesn't need to learn to think. I think all the time without learning.
    There's also an argument for machines already starting to crack learning with literal reinforcement training and feedback loops.
    Your language game was when you said the 'cognition ends...', as cognition is just a synonym for thinking. "The thinking ends when the inference cycle ends. It's not thinking'" becomes a clear contradiction.
    As for "the process by which it does that is wholly unrelated", buddy it's modelled on human neuron behaviour. That's how we've had this generative AI breakthrough. We've replicated human mental cognition as closely as we can with current technology and the output bears striking resemblance to our own generative capabilities (thoughts).
    Happy to admit it's not identical, but it's damn well inside the definition of thinking, may also cover learning. It may be better to take a second look at human thinking and wonder if it's as cryptic and deep as we thought ten, twenty years ago.
- creer 14 hours ago
  
  Interesting, you think the associations your brain comes up with during sleep are NOT thinking?
  - keiferski 12 hours ago
    
    I assume most people agree with that framing. “Thinking” generally implies a level of consciousness or self-awareness.
    “Let me think about this.” “I have to think on it.”
    My brain regulates all sorts of processes unconsciously, like breathing, for example. I don’t treat those as “thinking,” so I don’t know why other unconscious brain activity would be either.
    
    creer 9 hours ago
    
    I don't know that most people think about that at all. For me, I have learned not to dismiss the remuninating, reordering, reframing that my brain performs without me deliberately reasoning line by line about it. Limiting my brain to deliberate line by line reasoning would be very limiting and costly.
    "Thinking" to me is very much NOT just conscious reasoning. So much of what I think is not done consciously.
    Indeed "let me think about it" is often simply giving my brain time to "sit on it", for another expression - only after which will I have enough mind time on the various alternatives for a worthwhile conscious decision.
  - madaxe_again 12 hours ago
    
    I’d say it’s more like annealing. Sort, filter, compress, integrate.
    Updates your models for the next morning, which is why the answer is there when it wasn’t before.
- bithead 15 hours ago
  
  Do LLMs ever ask for you to clarify something you said in a way a person who doesn't quite understand what you said will do?
  - Workaccount2 4 hours ago
    
    Sometimes I think people leveraging criticisms of LLMs used ChatGPT 3 years ago and haven't touched one since, except for asking how many r's are in strawberry a year and a half ago.
  - savolai 13 hours ago
    
    Yeah, as someone who has gained a lot of interaction skills by playing with the constructivist learning ennvironment called the enneagram, I can attest that it much resembles behaviour characteristic of certain enneatypes.
  - brabel 12 hours ago
    
    What now, two minutes using one and you are going to get that!
  - willmarch 14 hours ago
    
    Yes, often
- lordnacho 11 hours ago
  
  > When conversing with LLMs, I never get the feeling that they have a solid grasp on the conversation. When you dig into topics, there is always a little too much vagueness, a slight but clear lack of coherence, continuity and awareness, a prevalence of cookie-cutter verbiage. It feels like a mind that isn’t fully “there” — and maybe not at all.
  Much like speaking to a less experienced colleague, no?
  They say things that contain the right ideas, but arrange it unconvincingly. Still useful to have though.
- madaxe_again 12 hours ago
  
  Perhaps this is an artefact of instantiation - when you talk with an LLM, the responding instance is just that - it comes into being, inhales your entire chat history, and then continues like the last chap, finishes its response, and dies.
  The continuity is currently an illusion.
- petralithic 14 hours ago
  
  > Would we call it “thinking”?
  Yes I would.
ben_w 8 hours ago

> Having seen LLMs so many times produce coherent, sensible and valid chains of reasoning to diagnose issues and bugs in software I work on, I am at this point in absolutely no doubt that they are thinking.
While I'm not willing to rule *out* the idea that they're "thinking" (nor "conscious" etc.), the obvious counter-argument here is all the records we have of humans doing thinking, where the records themselves are not doing the thinking that went into creating those records.
And I'm saying this as someone whose cached response to "it's just matrix multiplication it can't think/be conscious/be intelligent" is that, so far as we can measure all of reality, everything in the universe including ourselves can be expressed as matrix multiplication.
Falsification, not verification. What would be measurably different if the null hypothesis was wrong?
- chpatrick 8 hours ago
  
  I've definitely had AIs thinking and producing good answers about specific things that have definitely not been asked before on the internet. I think the stochastic parrot argument is well and truly dead by now.
  - Earw0rm 7 hours ago
    
    I've also experienced this, to an extent, but on qualitative topics the goodness of an answer - beyond basic requirements like being parseable and then plausible - is difficult to evaluate.
    They can certainly produce good-sounding answers, but as to the goodness of the advice they contain, YMMV.
    
    chpatrick 5 hours ago
    
    I've certainly got useful and verifiable answers. If you're not sure about something you can always ask it to justify it and then see if the arguments make sense.
  - hitarpetar 5 hours ago
    
    how do you definitely know that?
    
    stinos 3 hours ago
    
    Also, does it matter?
    The point being made here is about the data LLMs have been trained with. Sure that contains questions&answers but obviously not all of it is in that form. Just like an encyclopedie contains answers without the questions. So imo specifying this as 'no-one asked this before' is irrelevant.
    More interesting: did OP get a sensible answer to a question about data which definitely was not in the training set? (and indeed, how was this 'definitely' established'). Not that if the answer is 'yes' that'll prove 'thinking', as opposed to calling it e.g. advanced autocompletion, but it's a much better starting point.
    
    chpatrick 3 hours ago
    
    Because I gave them a unique problem I had and it came up with an answer it definitely didn't see in the training data.
    Specifically I wanted to know how I could interface two electronic components, one of which is niche, recent, handmade and doesn't have any public documentation so there's no way it could have known about it before.
    
    stinos 3 hours ago
    
    one of which is niche, recent, handmade and doesn't have any public documentation
    I still see 2 possibilities: you asked it something similar enough that it came up with a fairly standard answer which just happened to be correct, or you gave it enough info.
    - for example you created a new line of MCUs called FrobnicatorV2, and asked is 'how do I connect a power supply X to FrobnicatorV2' and it gave an answer like 'connect red wire to VCC and black to GND'. That's not exactly special.
    - or, you did desribe that component in some way. And you did do that using standard electronics lingo so essentially in terms of other existing components which it definitely did know (unless you invented something completely new not using any currently know physics). As such it's irrelevant that your particular new component wasn't known because you gave away the answer by describing it? E.g. you aksed it 'how do I connect a power supply X to an MCU with power pins Y and Z'. Again nothing special.
    
    chpatrick 2 hours ago
    
    If a human uses their general knowledge of electronics to answer a specific question they haven't seen before that's obviously thinking. I don't see why LLMs are held to a different standard. It's obviously not repeating an existing answer verbatim because that doesn't exist in my case.
    You're saying it's nothing "special" but we're not discussing whether it's special, but whether it can be considered thinking.
marcus_holmes 14 hours ago

Yes, I've seen the same things.
But; they don't learn. You can add stuff to their context, but they never get better at doing things, don't really understand feedback. An LLM given a task a thousand times will produce similar results a thousand times; it won't get better at it, or even quicker at it.
And you can't ask them to explain their thinking. If they are thinking, and I agree they might, they don't have any awareness of that process (like we do).
I think if we crack both of those then we'd be a lot closer to something I can recognise as actually thinking.
- theptip 14 hours ago
  
  > But; they don't learn
  If we took your brain and perfectly digitized it on read-only hardware, would you expect to still “think”?
  Do amnesiacs who are incapable of laying down long-term memories not think?
  I personally believe that memory formation and learning are one of the biggest cruces for general intelligence, but I can easily imagine thinking occurring without memory. (Yes, this is potentially ethically very worrying.)
  - lz400 13 hours ago
    
    >If we took your brain and perfectly digitized it on read-only hardware, would you expect to still “think”?
    it wouldn't work probably, brains constantly alter themselves by forming new connections. Learning is inseparable from our intelligence.
    
    WalterSear 12 hours ago
    
    Our intelligence, yes. But that doesn't establish it as essential for thought.
    
    lz400 12 hours ago
    
    I mean, _we_ probably can't think with our wetware on a read-only substrate. It doesn't establish it as essential, just that the only sure example in nature of thought doesn't work that way.
    
    theptip 3 hours ago
    
    Do you have any particular brain systems in mind that are essential for consciousness and also require mutable state?
  - zeroonetwothree 13 hours ago
    
    > If we took your brain and perfectly digitized it on read-only hardware, would you expect to still “think”?
    Perhaps this is already known, but I would think there is a high chance that our brains require "write access" to function. That is, the very process of neural activity inherently makes modifications to the underlying structure.
    
    theptip 3 hours ago
    
    > a high chance that our brains require "write access" to function
    There are multiple learning mechanisms that happen on different time-frames, eg neural plasticity, hippocampus are both longer-term processes for memory consolidation. Whereas the content of “RAM” might be better modeled as a set of fast dynamic weights representing ions and neurotransmitter concentrations.
    My hunch is that you could model the latter fast weights in volatile memory, but I wouldn’t count these as “modifying the structure”.
    Do you have any particular systems in mind where you have reason to believe that permanent modification is required for brains to function?
    
    xwolfi 12 hours ago
    
    I wonder why we need to sleep so much though
    
    theptip 3 hours ago
    
    Self-play for memory/skill consolidation, plus flushing out metabolites which can’t be cleaned while consciousness is online?
    
    throwaway-0001 9 hours ago
    
    Rebalancing weights?
- trenchpilgrim 11 hours ago
  
  > You can add stuff to their context, but they never get better at doing things, don't really understand feedback.
  I was using Claude Code today and it was absolutely capable of taking feedback to change behavior?
- jatora 14 hours ago
  
  This is just wrong though. They absolutely learn in-context in a single conversation within context limits. And they absolutely can explain their thinking; companies just block them from doing it.
geon 19 hours ago

Having seen LLMs so many times produce incoherent, nonsensical and invalid chains of reasoning...
LLMs are little more than RNGs. They are the tea leaves and you read whatever you want into them.
- rcxdude 6 hours ago
  
  They are clearly getting to useful and meaningful results with at a rate significantly better than chance (for example, the fact that ChatGPT can play chess well even though it sometimes tries to make illegal moves shows that there is a lot more happening there than just picking moves uniformly at random). Demanding perfection here seems to be odd given that humans also can make bizarre errors in reasoning (of course, generally at a lower rate and in a distribution of kinds of errors we are more used to dealing with).
  - matthewkayin 3 hours ago
    
    The fact that a model trained on the internet, on which the correct rules of chess are written, is unable to determine what is and is not a legal move, seems like a sign that these models are not reasoning about the questions asked of them. They are just giving responses that look like (and often are) correct chess moves.
    
    rcxdude 3 hours ago
    
    It's a sign that they are 'reasoning' imperfectly. If they were just giving responses that 'looked like' chess moves, they would be very bad at playing chess.
    (And I would hazard a guess that they are a primarily learning chess from the many games that are posted, as opposed to working things out from the rules. Indeed, if you make up a game and tell chatGPT the rules, it tends to be even worse at following them, let alone figuring out optimal play. But again, it will do so significantly better than random chance, so it's doing something with the information you give it, even if it's not doing so very well. I think it's reasonable to call this thinking, or reasoning, but this mostly becomes an argument of semantics. either way they do it significantly better than random chance but still not tremendously well. If your expectation is that they cannot work with anything novel then you're going to be continually surprised, but if your expectation is that they're as good as a human that has 'learned' from all the material its been given, especially material that's in-context and not in the training data, then you're also going to be disappointed.)
- bongodongobob 14 hours ago
  
  Ridiculous. I use it daily and get meaningful, quality results. Learn to use the tools.
  - 9rx 11 hours ago
    
    > Learn to use the tools.
    Thing is, you wouldn't need to learn to use the tools if the tool was able to think. A thinking entity is able to adapt to other parties who lack learnings. This confirms that LLMs are little more than fancy RNGs.
    > I use it daily and get meaningful, quality results.
    That's what the tea leaf readers say too, funnily enough.
    
    bongodongobob 11 hours ago
    
    Yeah, this is what LLMs might say too, funny enough.
    
    9rx 11 hours ago
    
    The probability is in its favour.
  - aydyn 14 hours ago
    
    Learn to work on interesting problems? If the problem you are working on is novel and hard, the AI will stumble.
    Generalizing your experience to everyone else's betrays a lack of imagination.
    
    khafra 12 hours ago
    
    > Generalizing your experience to everyone else's betrays a lack of imagination.
    One guy is generalizing from "they don't work for me" to "they don't work for anyone."
    The other one is saying "they do work for me, therefore they do work for some people."
    Note that the second of these is a logically valid generalization. Note also that it agrees with folks such as Tim Gowers, who work on novel and hard problems.
    
    dns_snek 11 hours ago
    
    No, that's decidedly not what is happening here.
    One is saying "I've seen an LLM spectacularly fail at basic reasoning enough times to know that LLMs don't have a general ability to think" (but they can sometimes reproduce the appearance of doing so).
    The other is trying to generalize "I've seen LLMs produce convincing thought processes therefore LLMs have the general ability to think" (and not just occasionally reproduce the appearance of doing so).
    And indeed, only one of these is a valid generalization.
    
    MrScruff 11 hours ago
    
    When we say "think" in this context, do we just mean generalize? LLMs clearly generalize (you can give one a problem that is not exactly in it's training data and it can solve it), but perhaps not to the extent a human can. But then we're talking about degrees. If it was able to generalize at a higher level of abstraction maybe more people would regard it as "thinking".
    
    dns_snek 10 hours ago
    
    I meant it in the same way the previous commenter did:
    > Having seen LLMs so many times produce incoherent, nonsensical and invalid chains of reasoning... LLMs are little more than RNGs. They are the tea leaves and you read whatever you want into them.
    Of course LLMs are capable of generating solutions that aren't in their training data sets but they don't arrive at those solutions through any sort of rigorous reasoning. This means that while their solutions can be impressive at times they're not reliable, they go down wrong paths that they can never get out of and they become less reliable the more autonomy they're given.
    
    dagss 5 hours ago
    
    It's rather seldom that humans arrive at solutions through rigorous reasoning. The word "think" doesn't mean "rigorous reasoning" in every day language. I'm sure 99% of human decisions are pattern matching on past experience.
    Even when mathematicians do in fact do rigorous reasoning, they use years to "train" first, to get experiences to pattern match from.
    
    Workaccount2 4 hours ago
    
    I have been on a crusade now for about a year to get people to share chats where SOTA LLMs have failed spectacularly to produce coherent, good information. Anything with Heavy hallucinations and outright bad information.
    So far, all I have gotten is data that is outside the knowledge cutoff (this is by far the most common) and technicality wrong information (Hawsmer House instead of Hosmer House) kind of fails.
    I thought maybe I hit on something with the recent BBC study about not trusting LLM output, but they used 2nd shelf/old mid-tier models to do their tests. Top LLMs correctly answered their test prompts.
    I'm still holding out for one of those totally off the rails Google AI overviews hallucinations showing up in a top shelf model.
    
    MrScruff 10 hours ago
    
    Sure, and I’ve seen the same. But I’ve also seen the amount to which they do that decrease rapidly over time, so if that trend continues would your opinion change?
    I don’t think there’s any point in comparing to human intelligence when assessing machine intelligence, there’s zero reason to think it would have similar qualities. It’s quite clear for the foreseeable future it will be far below human intelligence in many areas, while already exceeding humans in some areas that we regard as signs of intelligence.
    
    sdenton4 9 hours ago
    
    s/LLM/human/
    
    dns_snek 8 hours ago
    
    Clever. Yes, humans can be terrible at reasoning too, but in any half decent technical workplace it's so rare for people to fail to apply logic as often and in ways that are as frustrating to deal with as LLMs. And if they are then they should be fired.
    I can't say I remember a single coworker that would fit this description though many were frustrating to deal with for other reasons, of course.
    
    cindyllm 5 hours ago
    
    [dead]
    
    dimator 13 hours ago
    
    This is my experience. For rote generation, it's great, saves me from typing out the same boilerplate unit test bootstrap, or refactoring something that exists, etc.
    Any time I try to get a novel insight, it flails wildly, and nothing of value comes out. And yes, I am prompting incrementally and building up slowly.
    
    player1234 11 hours ago
    
    [flagged]
    
    tomhow 9 hours ago
    
    We've banned this account for repeated abusive comments to fellow community members. Normally we give warnings, but when it's as extreme and repetitive as we can see here, an instant ban is appropriate. If you don't want to be banned, you can email us at hn@ycombinator.com and demonstrate a sincere intent to use HN as intended in future.
    
    lordnacho 11 hours ago
    
    Even people who do actual hard work need a lot of ordinary scaffolding done for them.
    A secretary who works for an inventor is still thinking.
    
    tmhn2 10 hours ago
    
    Research mathematicians have been finding the tools useful [1][2]. I think those problems are interesting, novel, and hard. The AI might stumble sometimes, but it also produces meaningful, quality results sometimes. For experts working on interesting problems, that is enough to be useful.
    [1] https://mathstodon.xyz/@tao/115420236285085121 [2] https://xcancel.com/wtgowers/status/1984340182351634571
    
    dns_snek 9 hours ago
    
    That's a motte and bailey fallacy. Nobody said that they aren't useful, the argument is that they can't reason [1]. The world is full of useful tools that can't reason or think in any capacity.
    [1] That does not mean that they can never produce texts which describes a valid reasoning process, it means that they can't do so reliably. Sometimes their output can be genius and other times you're left questioning if they even have the reasoning skills of a 1st grader.
    
    chimprich 7 hours ago
    
    I don't agree that LLMs can't reason reliably. If you give them a simple reasoning question, they can generally make a decent attempt at coming up with a solution. Complete howlers are rare from cutting-edge models. (If you disagree, give an example!)
    Humans sometimes make mistakes in reasoning, too; sometimes they come up with conclusions that leave me completely bewildered (like somehow reasoning that the Earth is flat).
    I think we can all agree that humans are significantly better and more consistently good at reasoning than even the best LLM models, but the argument that LLMs cannot reliably reason doesn't seem to match the evidence.
    
    the-mitr 11 hours ago
    
    Even most humans will stumble on hard problems, that is the reason they are hard in the first place
    
    XenophileJKO 12 hours ago
    
    I'm genuinely curious what you work on that is so "novel" that an LLM doesn't work well on?
    I feel like so little is TRUELY novel. Almost everything is built on older concepts and to some degree expertise can be applied or repurposed.
    
    EagnaIonat 12 hours ago
    
    Anything relatively new in a technology LLMs struggle with, especially if the documentation is lacking.
    Godot for example in ChatGPT.
    It may no longer still be the case, but the documentation for GoDot was lacking and often samples written by others didn't have a version number associated with it. So samples it would suggest would never work, and even when you told it the version number it failed to generate workable code.
    The other stuff I've noticed is custom systems. One I work with is a variation of Java, but LLMs were treating it as javascript. I had to create a LoRA just to get the model from not trying to write javascript answer. Even then it could never work, because it had never been trained on real world examples.
    
    geon 10 hours ago
    
    It doesn't have to be very novel at all. Anything but the most basic TODO-list app.
    
    aydyn 11 hours ago
    
    Literally anything in the science domain. Adding features to your software app is indeed usually not novel.
    
    bongodongobob 11 hours ago
    
    That's where the bar is now?
    
    aydyn 11 hours ago
    
    huh?
    
    bongodongobob 11 hours ago
    
    Dude. We don't all work for NASA. Most day to day problems aren't novel. Most jobs aren't novel. Most jobs can't keep a variety of sometimes useful experts on hand. I do my job and I go home and do my hobbies. Anything I can use at work to keep friction down and productivity up is extremely valuable.
    Example prompt (paraphrasing and dumbed down, but not a ton): Some users across the country can't get to some fileshares. I know networking, but I'm not on the networking team so I don't have full access to switch, router, and firewall logs/configurations. It looks kind of random, but there must be a root cause, let's find it.
    I can't use Python(security team says so) and I don't have access to a Linux box that's joined to the domain and has access the shares.
    We are on a Windows domain controller. Write me a PowerShell 5.1 compatible script to be run remotely on devices. Use AD Sites and Services to find groups of random workstations and users at each office and tries to connect to all shares at each other site. Show me progress in the terminal and output an Excel file and Dot file that clearly illustrates successful and failed connections.
    ---
    And it works. Ok, I can see the issue is from certain sites that use x AND y VPN ipsec tunnels to get to particular cloud resources. I give this info to networking and they fix it right away. Problem resolved in less than an hour.
    First of all, a couple years ago, I wouldn't have been able to justify writing something like this while an outage is occuring. Could I do it myself? Sure, but I'm going to have to look up the specifics of syntax and certain commands and modules. I don't write PowerShell for a living or fun, but I do need to use it. I am familiar and know how to write it. But I sure as fuck couldn't sit down and spend an hour or two screwing around working on building a goddamn Dot file generator. Yes, years ago I had a whole pile of little utility modules I could use. But that's a far cry from what I can do now to fit the exact situation < 15 minutes while I do other things like pick up the phone, message coworkers, etc.
    Secondly, rather than building little custom tools to hook together as I need, I can just ask for the whole thing. I don't need to save any of that stuff anymore and re-figure out what CheckADFSConns(v2).PS1 that I wrote 8 months ago does and how to use it. "Oh, that's not the one, what the did I name that? Where did I put it?"
    I work in an environment that is decades old, the company is over 100 years old, I didn't build any of it myself, is not a tech company, and has tons of tech debt and weird shit. AI is insanely useful. For any given problem, there are dozens of different rabbit holes I could go down because of decades of complete system overhaul changes. Today, I can toss a variety of logs at AI and if nothing else, get a sense of direction of why a handful of PCs are rejecting some web certificates. (Combination of a new security policy and their times mismatching the domain controller, because it was new, and NTP wasn't configured properly. I wasn't even looking for timestamps, but it noticed event offsets and pointed it out).
    I feel like this community isn't very familiar with what that's like. We aren't all working on self driving cars or whatever seems hard at a brand new company with new everything and no budget. Some of us need to keep the systems running that help people to make actual things. These environments are far from pristine and are held together by underpaid and underappreciated normies through sheer willpower.
    Is this kind of work breaking technical frontiers? No. But it's complicated, difficult, and unpredictable. Is it novel? The problems are, sometimes.
    Generalizing your experience to everyone else's betrays your lack of self-awareness, sir.
  - bopbopbop7 14 hours ago
    
    They are only meaningful and quality if you don’t know what you’re doing. But please do show some of this meaningful and quality work so I can be proven wrong.
    
    hattmall 13 hours ago
    
    Yes, please this is literally what I want to see. I have yet to see an example where an LLM did anything that was sufficiently difficult. Not saying they can't be useful, but for anything past the basics they are really all over the place. And if we were paying anywhere near the true costs it wouldn't be even worth trying.
    
    bongodongobob 11 hours ago
    
    I'm not spending the time to de-anyonymize and exfiltrate my work for you. I exert less energy, spend less time to do my work, and get paid the same. I'd encourage you to do the same.
    
    seeEllArr 11 hours ago
    
    [dead]
    
    bongodongobob 11 hours ago
    
    See my comment to parent. One example of many. You can say "Oh, well, it just sounds like your company needs better tools and processes, you don't really need AI for any of that. You should just invest in a tool for this and monitor that and have managment prioritize..."
    Yeah, I know, yet here we are and it saves me boatloads of time.
    
    bopbopbop7 5 hours ago
    
    I never said any of that, look at my comment again. Or is the LLM replying for you also?
    
    bongodongobob 4 hours ago
    
    You said they are only useful if you don't know what you're doing. Or do you have a goldfish memory?
  - bryan_w 12 hours ago
    
    That's a skill issue on your end
techblueberry 7 hours ago

Isn’t anthropomorphizing LLMs rather than understanding their unique presence in the world a “ lack of imagination and flexibility of thought”? It’s not that I can’t imagine applying the concept “thinking” to the output on the screen, I just don’t think it’s an accurate description.
- heresie-dabord 6 hours ago
  
  Yes, it's an example of domain-specific thinking. "The tool helps me write code, and my job is hard so I believe this tool is a genius!"
  The Roomba vacuumed the room. Maybe it vacuumed the whole apartment. This is good and useful. Let us not diminish the value of the tool. But it's a tool.
  The tool may have other features, such as being self-documenting/self-announcing. Maybe it will frighten the cats less. This is also good and useful. But it's a tool.
  Humans are credulous. A tool is not a human. Meaningful thinking and ideation is not just "a series of steps" that I will declaim as I go merrily thinking. There is not just a vast training set ("Reality"), but also our complex adaptability that enables us to test our hypotheses.
  We should consider what it is in human ideation that leads people to claim that a Roomba, a chess programme, Weizenbaum's Eliza script, the IBM's Jeopardy system Watson, or an LLM trained on human-vetted data is thinking.
  Train such a system on the erroneous statements of a madman and suddenly the Roomba, Eliza, IBM Watson (and these other systems) lose our confidence.
  As it is today, the confidence we have in these systems is very conditional. It doesn't matter terribly if code is wrong... until it does.
  Computers are not humans. Computers can do things that humans cannot do. Computers can do these things fast and consistently. But fundamentally, algorithms are tools.
ph4rsikal 5 hours ago

It might appear so, but then you could validate it with a simple test. If the LLM would play a 4x4 Tic Tac Toe game, would the agent select the winning move 100% of all time or block a losing move 100% of the time? If these systems were capable of proper reasoning, then they would find the right choice in these obvious but constantly changing scenarios without being specifically trained for it.
[1] https://jdsemrau.substack.com/p/nemotron-vs-qwen-game-theory...
josefx 12 hours ago

Counterpoint: The seahorse emoji. The output repeats the same simple pattern of giving a bad result and correcting it with another bad result until it runs out of attempts. There is no reasoning, no diagnosis, just the same error over and over again within a single session.
- becquerel 12 hours ago
  
  A system having terminal failure modes doesn't inherently negate the rest of the system. Human intelligences fall prey to plenty of similarly bad behaviours like addiction.
  - josefx 11 hours ago
    
    I never met an addicted person that could be reduced to a simple while(true) print("fail") loop.
- throwaway-0001 9 hours ago
  
  You never had that coleague that says yes to everything and can’t get anything done? Same thing as seahorse.
camgunz an hour ago

Then the only thing I have to ask you is: what do you think this means in terms of how we treat LLMs? If they think, that is, they have cognition (which of course means they're self aware and sentient, how can you think and refer to yourself and not be these things), that puts them in a very exclusive club. What rights do you think we should be affording LLMs?
noiv 12 hours ago

Different PoV: You have a local bug and ask the digital hive mind for a solution, but someone already solved the issue and their solution was incorporated... LLMs are just very effficient at compressing billions of solutions into a few GB.
Try to ask something no one ever came up with a solution so far.
- brabel 12 hours ago
  
  This argument comes up often but can be easily dismissed. Make up a language and explain it to the LLM like you would to a person. Tell it to only use that language now to communicate. Even earlier AI was really good at this. You will probably move the goal posts and say that this is just pattern recognition, but it still fits nicely within your request for something that no one ever came up with.
  - emodendroket 11 hours ago
    
    I haven't tried in a while but at least previously you could completely flummox Gemini by asking it to come up with some plausible English words with no real known meaning; it just kept giving me rare and funny-sounding actual words and then eventually told me the task is impossible.
    
    tobyjsullivan 11 hours ago
    
    ChatGPT didn't have any issue when I recently asked something very similar.
    https://chatgpt.com/share/6909b7d2-20bc-8011-95b6-8a36f332ac...
  - noiv 9 hours ago
    
    Ask ChatGPT about ConLang. It knows. Inventing languages was solved a hundred years ago with Esperanto.
  - hitarpetar 5 hours ago
    
    > but it still fits nicely within your request for something that no one ever came up with
    YOU came up with it dawg
didibus 16 hours ago

I guess it depends if you definite thinking thinking as chaining coherent reasoning sentences together 90-some% of the time.
But if you define thinking as the mechanism and process we mentally undergo and follow mentally... I don't think we have any clue if that's the same. Do we also just vector-map attention tokens and predict the next with a softmax? I doubt, and I don't think we have any proof that we do.
- aydyn 11 hours ago
  
  We do know at the biochemical level how neurons work, and it isnt anything like huge matmuls.
jimbohn 4 hours ago

It's reinforcement learning applied to text, at a huge scale. So I'd still say that they are not thinking, but they are still useful. The question of the century IMO is if RL can magically solve all our issues when scaled enough.
hyperbovine 3 hours ago

Code gen is the absolute best case scenario for LLMs though: highly structured language, loads of training data, the ability to automatically error check the responses, etc. If they could mimic reasoning anywhere it would be on this problem.
I'm still not convinced they're thinking though because they faceplant on all sorts of other things that should be easy for something that is able to think.
ryanackley 8 hours ago

I think we can call it "thinking" but it's dangerous to anthropomorphize LLMs. The media and AI companies have an agenda when doing so.
conartist6 19 hours ago

Yeah but if I assign it a long job to process I would also say that an x86 CPU is "thinking" about a problem for me.
What we really mean in both cases is "computing," no?
raincole 20 hours ago

I'd represent the same idea but in a different way:
I don't know what the exact definition of "thinking" is. But if a definition of thinking rejects the possibility of that current LLMs think, I'd consider that definition useless.
- didibus 16 hours ago
  
  Why would it be useless?
  Generally thinking has been used to describe the process human follow in their brains when problem solving.
  If the Palms do not follow that process, they are not thinking.
  That doesn't mean they cannot solve problems using other mechanisms, they do, and we understand those mechanisms much better than we do human thinking.
tengbretson 4 hours ago

Too many people place their identity in their own thoughts/intellect. Acknowledging what the LLMs are doing as thought would basically be calling them human to people of that perspective.
uberduper 5 hours ago

Sometimes I start thinking our brains work the same way as an LLM does when it comes to language processing. Are we just using probability based on what we already know and the context of the statement we're making to select the next few words? Maybe we apply a few more rules than an LLM on what comes next as we go.
We train ourselves on content. We give more weight to some content than others. While listening to someone speak, we can often predict their next words.
What is thinking without language? Without language are we just bags of meat reacting to instincts and emotions? Are instincts and emotions what's missing for AGI?
khafra 12 hours ago

"Consciousness" as in subjective experience, whatever it is we mean by "the hard problem," is very much in doubt.
But "self-awareness," as in the ability to explicitly describe implicit, inner cognitive processes? That has some very strong evidence for it: https://www.anthropic.com/research/introspection
darthvaden 10 hours ago

If AI is thinking if slavery is bad then how can somebody own AI. How can investors can shares from AI profits? We are ok with slavery now. Ok i will have two black slaves now. Who can ask me? Why shld that be illegal?
- Manfred 9 hours ago
  
  Yikes, you're bypassing thousands of years of oppression, abuse, and human suffering by casually equating a term that is primarily associated with a human owning another human to a different context.
  There is a way to discuss if keeping intelligent artificial life under servitude without using those terms, especially if you're on a new account.
- ndsipa_pomu 10 hours ago
  
  I presume you are aware that the word "robot" is taken from a Czech word (robota) meaning "slave"
johnnienaked 16 hours ago

If you understand how they operate and you are reasonable and unbiased there is no way you could consider it thinking
mlsu 14 hours ago

They remind me of the apparitions in Solaris. They have this like mechanical, almost player-piano like quality to them. They both connect with and echo us at the same time. It seems crazy to me and very intellectually uncreative to not think of this as intelligence.
lordnacho 11 hours ago

I agree with you.
If you took a Claude session into a time machine to 2019 and called it "rent a programmer buddy," how many people would assume it was a human? The only hint that it wasn't a human programmer would be things where it was clearly better: it types things very fast, and seems to know every language.
You can set expectations in the way you would with a real programmer: "I have this script, it runs like this, please fix it so it does so and so". You can do this without being very precise in your explanation (though it helps) and you can make typos, yet it will still work. You can see it literally doing what you would do yourself: running the program, reading the errors, editing the program, and repeating.
People need to keep in mind two things when they compare LLMs to humans: you don't know the internal process of a human either, he is also just telling you that he ran the program, read the errors, and edited. The other thing is the bar for thinking: a four-year old kid who is incapable of any of these things you would not deny as a thinking person.
- kkapelon 10 hours ago
  
  > If you took a Claude session into a time machine to 2019 and called it "rent a programmer buddy," how many people would assume it was a human?
  Depends on the users. Junior devs might be fooled. Senior devs would quickly understand that something is wrong.
lispybanana 10 hours ago

Would they have diagnosed an issue if you hadn't presented it to them?
Life solves problems itself poses or collides with. Tools solve problems only when applied.
triyambakam 19 hours ago

> Having seen LLMs so many times produce coherent, sensible and valid chains of reasoning to diagnose issues and bugs in software I work on, I am at this point in absolutely no doubt that they are thinking.
People said the same thing about ELIZA
> Consciousness or self awareness is of course a different question,
Then how do you define thinking if not a process that requires consciousness?
- lordnacho 9 hours ago
  
  Why would it require consciousness, when we can't even settle on a definition for that?
donkeybeer 11 hours ago

Its overt or unaware religion. The point when you come down to the base of it is that these people believe in "souls".
satisfice 15 hours ago

I think you are the one dismissing evidence. The valid chains of reasoning you speak of (assuming you are talking about text you see in a “thinking model” as it is preparing its answer) are narratives, not the actual reasoning that leads to the answer you get.
I don’t know what LLMs are doing, but only a little experimentation with getting it to describe its own process shows that it CAN’T describe its own process.
You can call what a TI calculator does “thinking” if you want. But what people are interested in is human-like thinking. We have no reason to believe that the “thinking” of LLMs is human-like.
conartist6 8 hours ago

So an x86 CPU is thinking?
So many times I've seen it produce sensible, valid chains of results.
Yes, I see evidence in that outcome that a person somewhere thought and understood. I even sometimes say that a computer is "thinking hard" about something when it freezes up.
...but ascribing new philosophical meaning to this simple usage of the word "thinking" is a step too far. It's not even a new way of using the word!
- gchamonlive 8 hours ago
  
  You can't say for sure it is or it isn't thinking based solely on the substrate, because it's not known for sure if consciousness is dependent on the hardware it's running on -- for a lack of a better analogy -- to manifest, if it really needs an organic brain or if it could manifest in silicon based solutions.
  - conartist6 8 hours ago
    
    I agree. I'm just pointing out that the meaning of the word "think" already applied to the silicon substrate pre-ai, so just saying it's still applicable isn't that compelling.
    But yeah, I am fully willing to believe that a silicon based life form could think and be alive. i just don't think we're there. Yes this thing speaks using a passable imitation of the voices of PhDs and poets, but in a way a simulated annelid is more alive.
libraryatnight 3 hours ago

If you're sensitive to patterns and have been chronically online for the last few decades it's obvious they are not thinking.
Zardoz84 11 hours ago

Having seen parrots so many times produce coherent, sensible, and valid chains of sounds and words, I am at this point in absolutely no doubt that they are thinking.
- _puk 10 hours ago
  
  You think parrots don't think?
xhkkffbf 3 hours ago

Instead of thinking, "Wow. AIs are smart like humans", maybe we should say, "Humans are dumb like matrix multiplication?"
intended 7 hours ago

what sound does a falling tree make if no one is listening?
I’ve asked LLMs to write code for me in fields I have little background knowledge, and then had to debug the whole thing after essentially having to learn the language and field.
On the other hand, for things I am well versed in, I can debug the output and avoid entire swathes of failed states, by having a clear prompt.
Its why I now insist that any discussion on GenAI projects also have the speaker mention the level of seniority they have ( proxy for S/W eng experience), Their familiarity with the language, the project itself (level of complexity) - more so than the output.
I also guarantee - that most people have VERY weak express knowledge of how their brains actually work, but deep inherent reflexes and intuitions.
hitarpetar 5 hours ago

> The world is full of bizarre wonders and this is just one more to add to the list.
https://youtu.be/_-agl0pOQfs?si=Xiyf0InqtjND9BnF
ForHackernews 6 hours ago

But all those times the same system produces irrational gibberish don't count? GPT-5 will commonly make mistakes no thinking human could ever make.
Human: I'm trying to get my wolf, sheep and cabbage across the river in this boat, but the wolf keeps eating the sheep or the sheep eats the cabbage
Bot: You should put the sheep in the boat and take it across — if we delve into the biology of Canis lupus we discover that wolves don't eat cabbage!
H: Ok, so that worked great so far, the sheep is on one side and the wolf/cabbage is on the other.
B: Now, Option 1 is to bring the wolf across, or Option 2 you can bring the cabbage. I recommend (2) taking the cabbage as cabbages are smaller and easier to transport in a boat.
H: But then the sheep eats the cabbage, right? Remember that?
B: Exactly, that's sharp thinking. If you put the sheep and the cabbage together on the same side of the river, the sheep is sure to devour the cabbage. We need to not just separate sheep from cabbages — we need to separate cabbages from sheep! :rocketship:
hagbarth 10 hours ago

I'm not so sure. I, for one, do not think purely by talking to myself. I do that sometimes, but a lot of the time when I am working through something, I have many more dimensions to my thought than inner speech.
flanked-evergl 8 hours ago

"Convince" the stock Claude Sonnet 4.5 that it's a sentient human being hooked up to Neuralink and then tell me again it's thinking. It's just not.
belter 10 hours ago

Apparent reasoning can emerge from probabilistic systems that simply reproduce statistical order not genuine understanding.
Weather models sometimes “predict” a real pattern by chance, yet we don’t call the atmosphere intelligent.
If LLMs were truly thinking, we could enroll one at MIT and expect it to graduate, not just autocomplete its way through the syllabus or we could teach one how to drive.
yawpitch 11 hours ago

You’re assuming the issues and bugs you’ve been addressing don’t already exist, already encoding human chain of reasoning, in the training data.
NoMoreNicksLeft 13 hours ago

>Having seen LLMs so many times produce coherent, sensible and valid chains of reasoning to diagnose issues and bugs in software I work on, I am at this point in absolutely no doubt that they are thinking.
If one could write a quadrillion-line python script of nothing but if/elif/else statements nested 1 million blocks deep that seemingly parsed your questions and produced seemingly coherent, sensible, valid "chains of reasoning"... would that software be thinking?
And if you don't like the answer, how is the LLM fundamentally different from the software I describe?
>Knee jerk dismissing the evidence in front of your eyes because
There is no evidence here. On the very remote possibility that LLMs are at some level doing what humans are doing, I would then feel really pathetic that humans are as non-sapient as the LLMs. The same way that there is a hole in your vision because of a defective retina, there is a hole in your cognition that blinds you to how cognition works. Because of this, you and all the other humans are stumbling around in the dark, trying to invent intelligence by accident, rather than just introspecting and writing it out from scratch. While our species might someday eventually brute force AGI, it would be many thousands of years before we get there.
- hattmall 13 hours ago
  
  I write software that is far less complex and I consider it to be "thinking" while it is working through multiple possible permutations of output and selecting the best one. Unless you rigidly define thinking, processing, computing, it's reasonable to use them interchangeably.
  - emodendroket 11 hours ago
    
    To borrow a line from Dijkstra, the claim seems a bit like saying that a submarine is swimming.
    
    gkbrk 9 hours ago
    
    I think most people would agree that submarines are swimming.
    
    isaacremuant 7 hours ago
    
    No. Cars don't walk. Submarines don't swim. You can call it navigation if you want but words have meaning.
    > to move through water by moving the body or parts of the body.
    
    hackinthebochs 5 hours ago
    
    Spinning propellers is "moving parts of the [submarines] body"
- helloplanets 13 hours ago
  
  10^15 lines of code is a lot. We would pretty quickly enter the realm of it not having much to do with programming and more about just treating the LOC count as an amount of memory allocated to do X.
  How much resemblance does the information in the conditionals need to have with the actual input, or can they immediately be transformed to a completely separate 'language' which simply uses the string object as its conduit? Can the 10^15 lines of code be generated with an external algorithm, or is it assumed that I'd written it by hand given an infinitely long lifespan?
veegee 11 hours ago

[dead]
absurd1st 9 hours ago

[dead]
smohare 5 hours ago

[dead]
ath3nd 20 hours ago

[dead]

mft_ a day ago

Personal take: LLMs are probably part of the answer (to AGI?) but are hugely handicapped by their current architecture: the only time that long-term memories are formed is during training, and everything after that (once they're being interacted with) sits only in their context window, which is the equivalent of fungible, fallible, lossy short-term memory. [0] I suspect that many things they currently struggle with can be traced back to this.

Overcome this fundamental limitation and we'll have created introspection and self-learning. However, it's hard to predict whether this will allow them to make novel, intuitive leaps of discovery?

[0] It's an imperfect analogy, but we're expecting perfection from creations which are similarly handicapped as Leonard Shelby in the film Memento.

superxpro12 4 hours ago

Yeah because when you open that door, you can simply overwhelm the models with whatever conclusion you want through sheer volume of inputs.
This is the fundamental limitation. The obvious way around this is to pre-program it with rationalization... rules that limit the conclusions it can reach... and now you're not very far removed from propaganda generators. We see this constantly with Musk and Grok whenever Grok replies with something not-quite-far-right-enough.
In a purist sense, these things should be free to form their own conclusions, but those "Seeds" that are planted in the models are almost philosophical. Which answer should it prefer for "the trolley problem", for example.
- aorloff 2 hours ago
  
  Its almost like you have to experience the world in order to know what to believe.
mycall 9 hours ago

MIT have developed a technique called Self-Adapting Language Models (SEAL), which enables LLMs to continuously improve by generating their own synthetic training data and updating their internal parameters in response to new information.
ToolAlpaca, InterCode and Reflexion are taking different approaches among others.
LLMs of tomorrow will be quite different.
versteegen 8 hours ago

Yes, but it's not just memory hierarchy on which plain transformer-based LLMs are handicapped, there are many deficiencies. (For example, why must they do all their thinking upfront in thinking blocks rather than at any point when they become uncertain?) I'm not sure why you link memory to introspection.
This is why so many people (especially those that think they understand LLM limitations) massively underestimate the future progress of LLMs: people everywhere can see architectural problems and are working on fixing them. These aren't fundamental limitations of large DNN language models in general. Architecture can be adjusted. Turns out you can even put recurrence back in (SSMs) without worse scalability.
gabriel666smith 9 hours ago

I've spent a few weeks building and using a terminal LLM client based on that RLM paper that was floating around a little while ago. It's single-conversation, with a tiny, sliding context window, and then a tool that basically fuzzy searches across our full interaction history. It's memory is 'better' than mine - but anything that is essentially RAG inherently will be.
My learning so far, to your point on memory being a limiting factor, is that the system is able to build on ideas over time. I'm not sure you'd classify that as 'self-learning', and I haven't really pushed it in the direction of 'introspection' at all.
Memory itself (in this form) does not seem to be a silver bullet, though, by any means. However, as I add more 'tools', or 'agents', its ability to make 'leaps of discovery' does improve.
For example, I've been (very cautiously) allowing cron jobs to review a day's conversation, then spawn headless Claude Code instances to explore ideas or produce research on topics that I've been thinking about in the chat history.
That's not much different from the 'regular tasks' that Perplexity (and I think OpenAI) offer, but it definitely feels more like a singular entity. It's absolutely limited by how smart the conversation history is, at this time, though.
The Memento analogy you used does feel quite apt - there is a distinct sense of personhood available to something with memory that is inherently unavailable to a fresh context window.
- Workaccount2 4 hours ago
  
  I think a hidden problem even if we solve memory is the curation of what gets into memory and how it is weighted. Even humans struggle with this, as it's easy to store things and forget the credibility (or misjudge the credibility) of the source.
  I can envision LLMs getting worse upon being given a memory, until they can figure out how to properly curate it.
  - djmips 2 hours ago
    
    yes humans can be injection prompt hacked / mind poisoned - a good sales campaign is something like this. Propaganda.
bloppe 14 hours ago

https://en.wikipedia.org/wiki/Clive_Wearing
killerstorm 8 hours ago

FWIW there's already a number of proposals for augmenting LLMs with long-term memory. And many of them show promising results.
So, perhaps, what's needed is not a discovery, but a way to identify optimal method.
Note that it's hard to come up with a long-term memory test which would be different from either a long-context test (i.e. LLM remembers something over a long distance) or RAG-like test.
ivape 20 hours ago

It’s also hugely handicapped because it cannot churn in a continuous loop yet. For example, we humans are essentially a constant video stream of inputs from eyes to brain. This churns our brain, the running loop is our aliveness (not consciousness). At the moment, we get these LLMs to churn (chain of thought or reasoning loops) in a very limited fashion due to compute limitations.
If we get a little creative, and allow the LLM to self-inject concepts within this loop (as Anthropic explained here https://www.anthropic.com/research/introspection), then we’re taking about something that is seemingly active and adapting.
We’re not there yet, but we will be.
bitwize a day ago

I'm also reminded of the bit from Neuromancer where Case removes and then reinserts the Dixie Flatline "ROM construct" cartridge, resetting Dixie to the moment just before his death and causing him to forget their previous (albeit brief) conversation. Dixie can't meaningfully grow as a person. All that he ever will be is burned onto that cart; anything he learns since then is stored in temporary memory. Perhaps this is part of the reason why he wishes to be erased forever, ending his suffering.
- kenjackson a day ago
  
  "Dixie can't meaningfully grow as a person. All that he ever will be is burned onto that cart;"
  It's not that Dixie can't meaningful grow -- really the issue is that Dixie can be reset. If Dixie's cart simply degraded after 90 years, and you couldn't reset it, but everything else was the same -- would you then say Dixie could grow as a person? As humans we basically have a 90 year cart that once it no longer works, we're done. There is no reset. But we don't continue growing. You can't transfer us to a new body/brain. Once our temporary storage degrades, we cease to exist. Is that what makes us human?
  - anyonecancode 17 hours ago
    
    I strongly suspect the answer is yes -- or more broadly, what makes us conscious. And yes, this implies consciousness is something all life has, to some degree.
    I'm not going to pretend to have a good definition of what "consciousness" is, but directionally, I think having goals -- no, that's too weak -- having _desires_, is an important part of it. And I'm not sure it's possible to have desires if one cannot die.
    Something like an LLM can't actually die. Shut down all the machines its code runs on, then turn them back on, and it's in the same state it was before. So it's not the "hardware" that an LLM lives in. Is it the code itself? Copy it to another set of machines and it's the same program. Code + data? Maybe we run into storage issues, but in theory same thing -- transfer the code and date somemplace else and its the same program. You can't actually "kill" a computer program. So there's no inherent "mortality" to it that where any kinds of "desire" would emerge from.

everdrive a day ago

This is merely a debate about what it means to "think." We didn't really previously need to disambiguate thinking / intelligence / consciousness / sentience / ego / identity / etc.

Now, we do. Partly because of this we don't have really well defined ways to define these terms and think about. Can a handheld calculator think? Certainly, depending on how we define "think."

utopiah 5 hours ago

> We didn't really previously need to disambiguate thinking / intelligence / consciousness / sentience / ego / identity / etc.
Eh... Plato would like a word with you. Philosophy has been specifically trying to disentangle all that for millennia. Is this a joke?
- qntmfred 4 hours ago
  
  we didn't really previously NEED to, but we still tried bc it was an interesting question. today the NEED seems more urgent
  - bobro 3 hours ago
    
    Can you spell out more why we need it now?
  - doctorpangloss 2 hours ago
    
    sometimes hacker news links are about doing free work for microsoft, like finding bugs in Windows and Office
    sometimes hacker news links are about doing free work for microsoft, like rigorously defining AGI, because that triggers the OpenAI license-to-stock conversion now
    maybe this is why microsoft makes so much fucking money. Even the New Yorker cannot stop fucking doing work for them for free.
- everdrive 5 hours ago
  
  And Plato had no grounding in biology, and so his work here was quite interesting but also quite wrong.
  More precisely, I mean that the average person and the common culture has not really needed to disambiguate these terms. Can you define consciousness vs. sentience? And if you can, do you really think that the average person would share your definition? ie, your definition could be the _best_ definition, but my argument is that these are not widely agreed-upon terms.
  - logicchains 5 hours ago
    
    >And Plato had no grounding in biology, and so his work here was quite interesting but also quite wrong.
    Defining what a word should mean doesn't require any understanding of biology unless you make the assumption that it's a biology-related word. Why should the definition of "thinking" have any reference to biology? If you assume it does, then you're basically baking in the assumption that machines can't think.
    
    everdrive 4 hours ago
    
    Because until recently (I'm talking last 150-40 years here depending on how we want to define thinking) the only things that could think were various animals. And so 100% of their systems for thinking were rooted in biology. If an LLM can think (and I'm partial to thinking that it can) it's going to different in a number of ways from how a person would think. They may be some overlap, but there will be all these human / biology / evolutionary psychology things which are really person-specific. Even just basic stuff such as seeing faces in the clouds, or falling prey to appeals of emotion. (ie, because our thinking is wrapped up in other processes such as status / ego / survival / etc.) Thinking has only been biological for a long, long time. Non-biological thinking is pretty new, even if you extend it back to the early days of computing.
richardatlarge 12 hours ago

Somebody please get Wittgenstein on the phone
- Uehreka 9 hours ago
  
  Here you go: (holds up phone with a photo of of Wittgenstein on the screen)
  Ah shoot, that’s not what you meant is it? Just use more precise language next time and I’m sure you’ll be understood.
  - guy2345 9 hours ago
    
    what youre describing is an image of wittgenstein which is different from wittgenstein
    
    Uehreka 5 hours ago
    
    Oh no, I miscommunicated my Wittgenstein joke so badly it’s being interpreted as a Rene Magritte joke!

almosthere a day ago

Well, I think because we know how the code is written, in the sense that humans quite literally wrote the code for it - it's definitely not thinking, and it is literally doing what we asked, based on the data we gave it. It is specifically executing code we thought of. The output of course, we had no flying idea it would work this well.

But it is not sentient. It has no idea of a self or anything like that. If it makes people believe that it does, it is because we have written so much lore about it in the training data.

og_kalu a day ago

We do not write the code that makes it do what it does. We write the code that trains it to figure out how to do what it does. There's a big difference.
- almosthere a day ago
  
  The code that builds the models and performance inference from it is code we have written. The data in the model is obviously the big trick. But what I'm saying is that if you run inference, that alone does not give it super-powers over your computer. You can write some agentic framework where it WOULD have power over your computer, but that's not what I'm referring to.
  It's not a living thing inside the computer, it's just the inference building text token by token using probabilities based on the pre-computed model.
  - gf000 a day ago
    
    > It's not a living thing inside the computer, it's just the inference building text token by token using probabilities based on the pre-computed model.
    Sure, and humans are just biochemical reactions moving muscles as their interface with the physical word.
    I think the model of operation is not a good criticism, but please see my reply to the root comment in this thread where I detail my thoughts a bit.
  - og_kalu a day ago
    
    You cannot say, 'we know it's not thinking because we wrote the code' when the inference 'code' we wrote amounts to, 'Hey, just do whatever you figured out during training okay'.
    'Power over your computer', all that is orthogonal to the point. A human brain without a functioning body would still be thinking.
    
    almosthere a day ago
    
    Well, a model by itself with data that emits a bunch of human written words is literally no different than what JIRA does when it reads a database table and shits it out to a screen, except maybe a lot more GPU usage.
    I permit you, that yes, the data in the model is a LOT more cool, but some team could by hand, given billions of years (well probably at least 1 Octillion years), reproduce that model and save it to a disk. Again, no different than data stored in JIRA at that point.
    So basically if you have that stance you'd have to agree that when we FIRST invented computers, we created intelligence that is "thinking".
    
    og_kalu a day ago
    
    >Well, a model by itself with data that emits a bunch of human written words is literally no different than what JIRA does when it reads a database table and shits it out to a screen, except maybe a lot more GPU usage.
    Obviously, it is different or else we would just use JIRA and a database to replace GPT. Models very obviously do NOT store training data in the weights in the way you are imagining.
    >So basically if you have that stance you'd have to agree that when we FIRST invented computers, we created intelligence that is "thinking".
    Thinking is by all appearances substrate independent. The moment we created computers, we created another substrate that could, in the future think.
    
    almosthere 20 hours ago
    
    But LLMs are effectively a very complex if/else if tree:
    if the user types "hi" respond with "hi" or "bye" or "..." you get the point. It's basically storing the most probably following words (tokens) given the current point and its history.
    That's not a brain and it's not thinking. It's similar to JIRA because it's stored information and there are if statements (admins can do this, users can do that).
    Yes it is more complex, but it's nowhere near the complexity of the human or bird brain that does not use clocks, does not have "turing machines inside", or any of the other complete junk other people posted in this thread.
    The information in Jira is just less complex, but it's in the same vein of the data in an LLM, just 10^100 times more complex. Just because something is complex does not mean it thinks.
    
    iainmerrick 4 hours ago
    
    This is a pretty tired argument that I don't think really goes anywhere useful or illuminates anything (if I'm following you correctly, it sounds like the good old Chinese Room, where "a few slips of paper" can't possibly be conscious).
    Yes it is more complex, but it's nowhere near the complexity of the human or bird brain that does not use clocks, does not have "turing machines inside", or any of the other complete junk other people posted in this thread.
    The information in Jira is just less complex, but it's in the same vein of the data in an LLM, just 10^100 times more complex. Just because something is complex does not mean it thinks.
    So, what is the missing element that would satisfy you? It's "nowhere near the complexity of the human or bird brain", so I guess it needs to be more complex, but at the same time "just because something is complex does not mean it thinks".
    Does it need to be struck by lightning or something so it gets infused with the living essence?
    
    almosthere 3 hours ago
    
    Well, at the moment it needs to be born. Nothing else has agency on this planet. So yes, the bar is HIGH. Just because you have a computer that can count beans FAST, it does not mean because you counted a trillion beans that it was an important feat. When LLMs were created it made a lot of very useful software developments. But it is just a large data file that's read in a special way. It has no agency, it does not just start thinking on it's own unless it is programmatically fed data. It has to be triggered to do something.
    If you want the best comparison, it's closer to a plant- it reacts ONLY to external stimulous, sunlight, water, etc... but it does not think. (And I'm not comparing it to a plant so you can say - SEE you said it's alive!) It's just a comparison.
    
    MrScruff 11 hours ago
    
    You're getting to the heart of the problem here. At what point in evolutionary history does "thinking" exist in biological machines? Is a jumping spider "thinking"? What about consciousness?
  - hackinthebochs a day ago
    
    This is a bad take. We didn't write the model, we wrote an algorithm that searches the space of models that conform to some high level constraints as specified by the stacked transformer architecture. But stacked transformers are a very general computational paradigm. The training aspect converges the parameters to a specific model that well reproduces the training data. But the computational circuits the model picks out are discovered, not programmed. The emergent structures realize new computational dynamics that we are mostly blind to. We are not the programmers of these models, rather we are their incubators.
    As far as sentience is concerned, we can't say they aren't sentient because we don't know the computational structures these models realize, nor do we know the computational structures required for sentience.
    
    almosthere 19 hours ago
    
    However there is another big problem, this would require a blob of data in a file to be labelled as "alive" even if it's on a disk in a garbage dump with no cpu or gpu anywhere near it.
    The inference software that would normally read from that file is also not alive, as it's literally very concise code that we wrote to traverse through that file.
    So if the disk isn't alive, the file on it isn't alive, the inference software is not alive - then what are you saying is alive and thinking?
    
    hackinthebochs 18 hours ago
    
    This is an overly reductive view of a fully trained LLM. You have identified the pieces, but you miss the whole. The inference code is like a circuit builder, it represents the high level matmuls and the potential paths for dataflow. The data blob as the fully converged model configures this circuit builder in the sense of specifying the exact pathways information flows through the system. But this isn't some inert formalism, this is an active, potent causal structure realized by the base computational substrate that is influencing and being influenced by the world. If anything is conscious here, it would be this structure. If the computational theory of mind is true, then there are some specific information dynamics that realize consciousness. Whether or not LLM training finds these structures is an open question.
    
    electrograv 13 hours ago
    
    > So if the disk isn't alive, the file on it isn't alive, the inference software is not alive - then what are you saying is alive and thinking?
    “So if the severed head isn’t alive, the disembodied heart isn’t alive, the jar of blood we drained out isn’t alive - then what are you saying is alive and thinking?”
    - Some silicon alien life forms somewhere debating whether the human life form they just disassembled could ever be alive and thinking
    
    almosthere 3 hours ago
    
    Just because you saw a "HA - He used an argument that I can compare to a dead human" does not make your argument strong - there are many differences from a file on a computer vs a murdered human that will never come back and think again.
    
    goatlover 19 hours ago
    
    A similar point was made by Jaron Lanier in his paper, "You can't argue with a Zombie".
- mbesto a day ago
  
  I think the discrepancy is this:
  1. We trained it on a fraction of the world's information (e.g. text and media that is explicitly online)
  2. It carries all of the biases us humans have and worse the biases that are present in the information we chose to explicitly share online (which may or may not be different to the experiences humans have in every day life)
  - aryehof 11 hours ago
    
    I see this a lot in what LLMs know and promote in terms of software architecture.
    All seem biased to recent buzzwords and approaches. Discussions will include the same hand-waving of DDD, event-sourcing and hexagonal services, i.e. the current fashion. Nothing of worth apparently preceded them.
    I fear that we are condemned to a future where there is no new novel progress, but just a regurgitation of those current fashion and biases.
  - nix0n a day ago
    
    > It carries all of the biases us humans have and worse the biases that are present in the information we chose to explicitly share online
    This is going to be a huge problem. Most people assume computers are unbiased and rational, and increasing use of AI will lead to more and larger decisions being made by AI.
- abakker a day ago
  
  and then the code to give it context. AFAIU, there is a lot of post training "setup" in the context and variables to get the trained model to "behave as we instruct it to"
  Am I wrong about this?
gf000 a day ago

Well, unless you believe in some spiritual, non-physical aspect of consciousness, we could probably agree that human intelligence is Turing-complete (with a slightly sloppy use of terms).
So any other Turing-complete model can emulate it, including a computer. We can even randomly generate Turing machines, as they are just data. Now imagine we are extremely lucky and happen to end up with a super-intelligent program which through the mediums it can communicate (it could be simply text-based but a 2D video with audio is no different for my perspective) can't be differentiated from a human being.
Would you consider it sentient?
Now replace the random generation with, say, a back propagation algorithm. If it's sufficiently large, don't you think it's indifferent from the former case - that is, novel qualities could emerge?
With that said, I don't think that current LLMs are anywhere close to this category, but I just don't think this your reasoning is sound.
- DanHulton 18 hours ago
  
  > we could probably agree that human intelligence is Turing-complete (with a slightly sloppy use of terms). > So any other Turing-complete model can emulate it
  You're going off the rails IMMEDIATELY in your logic.
  Sure, one Turing-complete computer language can have its logic "emulated" by another, fine. But human intelligence is not a computer language -- you're mixing up the terms "Turing complete" and "Turing test".
  It's like mixing up the terms "Strawberry jam" and "traffic jam" and then going on to talk about how cars taste on toast. It's nonsensical.
  - gf000 12 hours ago
    
    Game of life, PowerPoint, and a bunch of non-PL stuff are all Turing-complete. I don't mix up terms, I did use a slightly sloppy terminology but it is the correct concept - and my point is that we don't know of a computational model that can't be expressed by a Turing-machine, humans are a physical "machine", ergo we must also fall into that category.
    Give my comment another read, but it was quite understandable from context. (Also, you may want to give a read to the Turing paper because being executable by a person as well was an important concept within)
    
    DanHulton 4 hours ago
    
    Again, you're going wildly off the rails in your logic. Sure, "executable by a human" is part of the definition for Turing machines, but that's only talking about Turing-specific capabilities. If you want to argue that a Turing machine can emulate the specific definition of Turing machine capabilities that humans can perform, that's fine. But you're saying that because humans can ACT LIKE Turing machines, they must BE Turing machines, and are therefore emulatable.
    This is the equivalent of saying "I have set up a complex mechanical computer powered by water that is Turing complete. Since any Turing complete system can emulate another one, it means that any other Turing complete system can also make things wet and irrigate farms.
    Human intelligence is not understood. It can be made to do Turing complete things, but you can't invert that and say that because you've read the paper on Turing completeness, you now understand human intelligence.
    
    coopierez 9 hours ago
    
    But humans can do things Turing machines cannot. Such as eating a sandwich.
    
    gf000 9 hours ago
    
    That's not a computation, it's a side effect. It just depends on what you wire your "computer" up to. A Turing machine in itself is just a (potentially non-returning) mathematical function, but you are free to map any input/output to it.
    Actually, the way LLMs are extended with tools is a pretty much the same (an LLM itself has no access to the internet, but if it returns some specific symbols, the external "glue" will do a search and then the LLM is free to use the results)
- almosthere a day ago
  
  We used to say "if you put a million monkeys on typewriters you would eventually get shakespear" and no one would ever say that anymore, because now we can literally write shakespear with an LLM.
  And the monkey strategy has been 100% dismissed as shit..
  We know how to deploy monkeys on typewriters, but we don't know what they'll type.
  We know how to deploy transformers to train and inference a model, but we don't know what they'll type.
  We DON'T know how a thinking human (or animal) brain works..
  Do you see the difference.
  - nearbuy 21 hours ago
    
    The monkeys on typewriters saying is just a colorful way of saying that an infinite random sequence will contain all finite sequences somewhere within it. Which is true. But I don't see what infinite random sequences have to do with LLMs or human thinking.
    > Do you see the difference
    No? I'm not sure what you're getting at.
  - KoolKat23 20 hours ago
    
    I was going to use this analogy in the exact opposite way. We do have a very good understanding of how the human brain works. Saying we don't understand how the brain works is like saying we don't understand how the weather works.
    If you put a million monkeys on typewriters you would eventually get shakespeare is exactly why LLM's will succeed and why humans have succeeded. If this weren't the case why didn't humans 30000 years ago create spacecraft if we were endowed with the same natural "gift".
    
    almosthere 19 hours ago
    
    Yeah no, show me one scientific paper that says we know how the brain works. And not a single neuron because that does absolute shit towards understanding thinking.
    
    KoolKat23 19 hours ago
    
    This is exactly why I mentioned the weather.
    A scientific paper has to be verifiable, you should be able to recreate the experiment and come to the same conclusion. It's very very difficult to do with brains with trillions of parameters and that can't be controlled to the neuron level. Nothwithstanding the ethical issues.
    We don't have a world weather simulator that is 100% accurate either given the complex interplay and inability to control the variables i.e. it's not verifiable. It'd be a bit silly to say we don't know why it's going to rain at my house tomorrow.
    Until then it is a hypothesis, and we can't say we know even if the overwhelming evidence indicates that in fact that we do know.
  - procaryote a day ago
    
    To be fair, we also trained the LLM on (among other things) shakespeare, and adjusted the weights so that generating shakespeare would be more likely after that training.
    We don't claim a JPEG can paint great art, even though certain jpegs do.
    
    almosthere 19 hours ago
    
    So, more proof it's not thinking, right? It can only regurgitate a large if/else superstructure with some jumping around.
    
    procaryote 12 hours ago
    
    Who truly knows if you can make an if-else + randomness structure big enough to become smart?
    But yes, we built a machine that generates text similar to what we built it from, and now we're looking at it generating text and are all impressed.
- myrmidon a day ago
  
  > Would you consider it sentient?
  Absolutely.
  If you simulated a human brain by the atom, would you think the resulting construct would NOT be? What would be missing?
  I think consciousness is simply an emergent property of our nervous system, but in order to express itself "language" is obviously needed and thus requires lots of complexity (more than what we typically see in animals or computer systems until recently).
  - prmph a day ago
    
    > If you simulated a human brain by the atom,
    That is what we don't know is possible. You don't even know what physics or particles are as yet undiscovered. And from what we even know currently, atoms are too coarse to form the basis of such "cloning"
    And, my viewpoint is that, even if this were possible, just because you simulated a brain atom by atom, does not mean you have a consciousness. If it is the arrangement of matter that gives rise to consciousness, then would that new consciousness be the same person or not?
    If you have a basis for answering that question, let's hear it.
    
    myrmidon a day ago
    
    > You don't even know what physics or particles are as yet undiscovered
    You would not need the simulation to be perfect; there is ample evidence that our brains a quite robust against disturbances.
    > just because you simulated a brain atom by atom, does not mean you have a consciousness.
    If you don't want that to be true, you need some kind of magic, that makes the simulation behave differently from reality.
    How would a simulation of your brain react to an question that you would answer "consciously"? If it gives the same responds to the same inputs, how could you argue it isnt't conscious?
    > If it is the arrangement of matter that gives rise to consciousness, then would that new consciousness be the same person or not?
    The simulated consciousness would be a different one from the original; both could exist at the same time and would be expected to diverge. But their reactions/internal state/thoughts could be matched at least for an instant, and be very similar for potentially much longer.
    I think this is just Occams razor applied to our minds: There is no evidence whatsoever that our thinking is linked to anything outside of our brains, or outside the realm of physics.
    
    prmph a day ago
    
    > "quite robust against disturbances."
    does not mean that the essential thing gives rise to consciousness is only approximate. To give an example from software, you can write software is robust against bad input, attempts to crash it, even bit flips. But, if I came in and just changed a single character in the source code, that may cause it to fail compilation, fail to run, or become quite buggy.
    > If you don't want that to be true, you need some kind of magic,
    This is just what I'm saying is a false dichotomy. The only reason some are unable to see beyond it is that we think the basic logic we understand are all there could be.
    In this respect physics has been very helpful, because without peering into reality, we would have kept deluding ourselves that pure reason was enough to understand the world.
    It's like trying to explain quantum mechanics to a well educated person or scientist from the 16th century without the benefit of experimental evidence. No way they'd believe you. In fact, they'd accuse you of violating basic logic.
    
    myrmidon a day ago
    
    How is it a false dichotomy? If you want consciousness to NOT be simulateable, then you need some essential component to our minds that can't be simulated (call it soul or whatever) and for that thing to interface with our physical bodies (obviously).
    We have zero evidence for either.
    > does not mean that the essential thing gives rise to consciousness is only approximate
    But we have 8 billion different instances that are presumably conscious; plenty of them have all kinds of defects, and the whole architecture has been derived by a completely mechanical process free of any understanding (=> evolution/selection).
    On the other hand, there is zero evidence of consciousness continuing/running before or after our physical brains are operational.
    
    prmph a day ago
    
    > plenty of them have all kinds of defects,
    Defects that have not rendered them unconscious, as long as they still are alive. You seem not to see the circularity of your argument.
    I gave you an example to show that robustness against adverse conditions is NOT the same as internal resiliency. Those defect, as far as we know, are not affecting the origin of consciousness itself. Which is my point.
    > How is it a false dichotomy? If you want consciousness to NOT be simulateable, then you need some essential component to our minds that can't be simulated (call it soul or whatever) and for that thing to interface with our physical bodies (obviously).
    If you need two things to happen at the same time in sync with each other no matter if they are separated by billions of miles, then you need faster-than-light travel, or some magic [1]; see what I did there?
    1. I.e., quantum entanglement
    
    myrmidon 21 hours ago
    
    > If you need two things to happen at the same time in sync with each other no matter if they are separated by billions of miles, then you need faster-than-light travel, or some magic [1]; see what I did there?
    No. Because even if you had solid evidence for the hypothesis that quantum mechanical effects are indispensable in making our brains work (which we don't), then that is still not preventing simulation. You need some uncomputable component, which physics right now neither provides nor predicts.
    And fleeing into "we don't know 100% of physics yet" is a bad hypothesis, because we can make very accurate physical predictions already-- you would need our brains to "amplify" some very small gap in our physical understanding, and this does not match with how "robust" the operation of our brain is-- amplifiers, by their very nature, are highly sensitive to disruption or disturbances, but a human can stay conscious even with a particle accelerator firing through his brain.
    
    tsimionescu 21 hours ago
    
    > If you need two things to happen at the same time in sync with each other no matter if they are separated by billions of miles, then you need faster-than-light travel, or some magic [1]
    This makes no sense as written - by definition, there is no concept of "at the same time" for events that are spacelike separated like this. Quantum entanglement allows you to know something about the statistical outcomes of experiments that are carried over a long distance away from you, but that's about it (there's a simpler version, where you can know some facts for certain, but that one actually looks just like classical correlation, so it's not that interesting on its own).
    I do get the point that we don't know what we don't know, so that a radical new form of physics, as alien to current physics as quantum entanglement is to classical physics, could exist. But this is an anti-scientific position to take. There's nothing about consciousness that breaks any known law of physics today, so the only logical position is to suppose that consciousness is explainable by current physics. We can't go around positing unknown new physics behind every phenomenon we haven't entirely characterized and understood yet.
    
    prmph 4 hours ago
    
    > There's nothing about consciousness that breaks any known law of physics today, so the only logical position is to suppose that consciousness is explainable by current physics
    Quite the claim to make
    
    uwagar a day ago
    
    dude u need to do some psychedelics.
    
    gf000 a day ago
    
    Well, if you were to magically make an exact replica of a person, wouldn't it be conscious and at time 0 be the same person?
    But later on, he would get different experiences and become a different person no longer identical to the first.
    In extension, I would argue that magically "translating" a person to another medium (e.g. a chip) would still make for the same person, initially.
    Though the word "magic" does a lot of work here.
    
    prmph a day ago
    
    I'm not talking about "identical" consciousnesses. I mean the same consciousness. The same consciousness cannot split into two, can it?
    Either it is (and continues to be) the same consciousness, or it is not. If it were the same consciousness, then you would have a person who exists in two places at once.
    
    tsimionescu 21 hours ago
    
    Well, "the same consciousness" it's not, as for example it occupies a different position in spacetime. It's an identical copy for a split second, and then they start diverging. Nothing so deep about any of this. When I copy a file from one disk to another, it's not the same file, they're identical copies for some time (usually, assuming no defects in the copying process), and will likely start diverging afterwards.
    
    gf000 a day ago
    
    Consciousness has no agreed upon definition to begin with, but I like to think of it as to what a whirlwind is to a bunch of air molecules (that is, an example of emergent behavior)
    So your question is, are two whirlwinds with identical properties (same speed, same direction, shape etc) the same in one box of air, vs another identical box?
    
    prmph a day ago
    
    Exactly, I guess this starts to get into philosophical questions around identity real quick.
    To me, two such whirlwinds are identical but not the same. They are the same only if they are guaranteed to have the same value for every conceivable property, forever, and even this condition may not be enough.
    
    quantum_state 19 hours ago
    
    At some point, quantum effects will need to be accounted for. The no cloning theorem will make it hard to replicate the quantum state of the brain.
- prmph a day ago
  
  There are many aspects to this that people like yourself miss, but I think we need satisfactory answers to them (or at least rigorous explorations of them) before we can make headway in these sorts of discussion.
  Imagine we assume that A.I. could be conscious. What would be the identity/scope of that consciousness. To understand what I'm driving at, let's make an analogy to humans. Our consciousness is scoped to our bodies. We see through sense organ, and our brain, which process these signals, is located in a specific point in space. But we still do not know how consciousness arises in the brain and is bound to the body.
  If you equate computation of sufficient complexity to consciousness, then the question arises: what exactly about computation would prodcuce consciousness? If we perform the same computation on a different substrate, would that then be the same consciousness, or a copy of the original? If it would not be the same consciousness, then just what give consciousness its identity?
  I believe you would find it ridiculous to say that just because we are performing the computation on this chip, therefore the identity of the resulting consciousness is scoped to this chip.
  - gf000 a day ago
    
    > Imagine we assume that A.I. could be conscious. What would be the identity/scope of that consciousness
    Well, first I would ask whether this question makes sense in the first place. Does consciousness have a scope? Does consciousness even exist? Or is that more of a name attributed to some pattern we recognize in our own way of thinking (but may not be universal)?
    Also, would a person missing an arm, but having a robot arm they can control have their consciousness' "scope" extended to it? Given that people have phantom pains, does a physical body even needed to consider it your part?
  - tsimionescu 21 hours ago
    
    This all sounds very irrelevant. Consciousness is clearly tied to specific parts of a substrate. My consciousness doesn't change when a hair falls off my head, nor when I cut my fingernails. But it does change in some way if you were to cut the tip of my finger, or if I take a hormone pill.
    Similarly, if we can compute consciousness on a chip, then the chip obviously contains that consciousness. You can experimentally determine to what extent this is true: for example, you can experimentally check if increasing the clock frequency of said chip alters the consciousness that it is computing. Or if changing the thermal paste that attaches it to its cooler does so. I don't know what the results of these experiments would be, but they would be quite clearly determined.
    Of course, there would certainly be some scale, and at some point it becomes semantics. The same is true with human consciousness: some aspects of the body are more tightly coupled to consciousness than others; if you cut my hand, my consciousness will change more than if you cut a small piece of my bowel, but less than if you cut out a large piece of my brain. At what point do you draw the line and say "consciousness exists in the brain but not the hands"? It's all arbitrary to some extent. Even worse, say I use a journal where I write down some of my most cherished thoughts, and say that I am quite forgetful and I often go through this journal to remind myself of various thoughts before taking a decision. Would it not then be fair to say that the journal itself contains a part of my consciousness? After all, if someone were to tamper with it in subtle enough ways, they would certainly be able to influence my thought process, more so than even cutting off one of my hands, wouldn't they?
    
    prmph 20 hours ago
    
    You make some interesting points, but:
    > Similarly, if we can compute consciousness on a chip, then the chip obviously contains that consciousness.
    This is like claiming that neurons are conscious, which as far as we can tell, they are not. For all you know, it is the algorithm that could be conscious. Or some interplay between the algorithm and the substrate, OR something else.
    Another way to think of it problem: Imagine a massive cluster performing computation that is thought to give rise to consciousness. Is is the cluster that is conscious? Or the individual machines, or the chips, or the algorithm, or something else?
    I personally don't think any of these can be conscious, but those that do should explain how they figure these thing out.
    
    hackinthebochs 5 hours ago
    
    >Is is the cluster that is conscious? Or the individual machines, or the chips, or the algorithm, or something else?
    The bound informational dynamic that supervenes on the activity of the individual units in the cluster. What people typically miss is that the algorithm when engaged in a computing substrate is not just inert symbols, but an active, potent causal/dynamical structure. Information flows as modulated signals to and from each component and these signals are integrated such that the characteristic property of the aggregate signal is maintained. This binding of signals by the active interplay of component signals from the distributed components realizes the singular identity. If there is consciousness here, it is in this construct.
    
    tsimionescu 20 hours ago
    
    I explained the experiments that you would do to figure that out: you modify parts of the system, and check if and how much that affects the consciousness. Paint the interconnects a different color: probably won't affect it. Replace the interconnect protocol with a different one: probably will have some effect. So, the paint on the interconnect: not a part of the consciousness. The interconnect protocol: part of the consciousness. If we are convinced that this is a real consciousness and thus these experiments are immoral, we simply wait until accidents naturally occur and draw conclusions from that, just like we do with human consciousness.
    Of course, "the consciousness" is a nebulous concept. It would be like asking "which part of my processor is Windows" to some extent. But it's still fair to say that Windows is contained within my computer, and that the metal framing of the computer is not part of Windows.
kakapo5672 20 hours ago

It's not accurate to say we "wrote the code for it". AI isn't built like normal software. Nowhere inside an AI will you find lines of code that say If X Then Y, and so on.
Rather, these models are literally grown during the training phase. And all the intelligence emerges from that growth. That's what makes them a black box and extremely difficult to penetrate. No one can say exactly how they work inside for a given problem.
mirekrusin a day ago

Now convince us that you’re sentient and not just regurgitating what you’ve heard and seen in your life.
- embedding-shape a day ago
  
  By what definition of "sentience"? Wikipedia claims "Sentience is the ability to experience feelings and sensations" as an opening statement, which I think would be trivial depending again on your definition of "experience" and "sensations". Can a LLM hooked up to sensor events be considered to "experience sensations"? I could see arguments both ways for that.
  - vidarh a day ago
    
    I have no way of measuring whether or not you experience feelings and sensations, or are just regurgitating statements to convince me of that.
    The only basis I have for assuming you are sentient according to that definition is trust in your self-reports.
    
    darkwater a day ago
    
    > The only basis I have for assuming you are sentient according to that definition is trust in your self-reports
    Because the other person is part of your same species so you project your own base capabilities onto them, because so far they shown to behave pretty similarly to how you behave. Which is the most reasonable thing to do.
    Now, the day we have cyborgs that mimic also the bodies of a human a la Battlestar Galactica, we will have an interesting problem.
    
    vidarh 21 hours ago
    
    It's the most reasonable thing to do because we have no actual way of measuring and knowing. It is still speculation.
    
    embedding-shape a day ago
    
    I'm fairly sure we can measure human "sensation" as in detect physiological activity in the body in someone who is under anesthesia yet the body reacts in different ways to touch or pain.
    The "feelings" part is probably harder though.
    
    mirekrusin 10 hours ago
    
    You can measure model activity even better.
    How do you know that model processing text or image input doesn't go through feeling of confusion or excitement or corrupted image doesn't "smell" right for it?
    Just the fact that you can pause and restart it doesn't mean it doesn't emerge.
    
    vidarh 20 hours ago
    
    We can measure the physiological activity, but not whether it gives rise to the same sensations that we experience ourselves. We can reasonably project and guess that they are the same, but we can not know.
    In practical terms it does not matter - it is reasonable for us to act as if others do experience the same we do. But if we are to talk about the nature of conscience and sentience it does matter that the only basis we have for knowing about other sentient beings is their self-reported experience.
    
    goatlover 19 hours ago
    
    We know that others do not experience the exact same sensations, because there are reported differences, some of which has been discussed on HN, such as aphantasia. The opposite would be visual thinkers. Then you have super tasters and smellers, people who have very refined palats, perhaps because their gustary and/or oilfactory senses are more heightened. Then you have savants like the musical genius who would hear three separate strands of music in his head at the same time.
Llamamoe a day ago

This is probably true. But the truth is we have absolutely no idea what sentience is and what gives rise to it. We cannot identify why humans have it rather than just being complex biological machines, or whether and why other animals do. We have no idea what the rules or, nevermind how and why they would or wouldn't apply to AI.
mentos a day ago

What’s crazy to me is the mechanism of pleasure or pain. I can understand that with enough complexity we can give rise to sentience but what does it take to achieve sensation?
- dontwearitout a day ago
  
  This is the "hard problem of consciousness". It's more important than ever as machines begin to act more like humans, but my takeaway is we have no idea. https://en.wikipedia.org/wiki/Hard_problem_of_consciousness
- vidarh a day ago
  
  Input is input. There's no reason why we should assume that a data source from embodiment is any different to any other data source.
- spicyusername a day ago
  
  A body
  - mentos a day ago
    
    I’d say it’s possible to experience mental anguish/worry without the body participating. Solely a cognitive pain from consternation.
    
    AndrewKemendo a day ago
    
    You can’t cognate without a body - the brain and body is a material system tightly coupled
    
    vidarh a day ago
    
    Ignoring that "cognate" isn't a verb, we have basis for making any claim about the necessity of that coupling.
  - exe34 a day ago
    
    How does a body know what's going on? Would you say it has any input devices?
- kbrkbr a day ago
  
  Can you tell me how you understand that?
  Because I sincerely do not. I have frankly no idea how sentience arises from non sentience. But it's a topic that really interests me.
  - mentos a day ago
    
    We have examples of non sentience everywhere already with animals. And then an example of sentience with humans. So if you diff our brains the difference lies within a module in our prefrontal cortex. It’s a black box of logic but I can ‘understand’ or be willing to accept that it’s owed to ‘just’ more grey matter adding the self awareness to the rest of the system.
    But to me the big mystery is how animals have sensation at all to begin with. What gives rise to that is a greater mystery to me personally.
    There are examples of people who have no ability to feel pain yet are still able to think. Now I wonder if they ever experience mental anguish.
    
    DoctorOetker 14 hours ago
    
    I'd like to see a vote here, what percentage of HN readers believe animals have sentience or no sentience?
    Clearly most animals are less educated, and most are less intelligent, but non-sentient? That sounds like 200-year old claims that "when one steps on the tail of a cat, it does indeed protest loudly, but not because it feels anything or because it would be sentient, no, no, it protests merely due to selective pressure, programming reflex circuits, since other creatures would show compassion, or back off due to a potential reaction by the cat."
    Anyone who has had a pet like a cat or a dog knows they are sentient... if we consider ourselves sentient.
    
    kbrkbr 13 hours ago
    
    I'm with you on this.
    But asked for reasons I can only point to the social nature of their societies, where love and anger make sense, or of their hurt-behavior.
    I also find it very hard to believe that everything else is slow evolution of components, and here all of a sudden something super complex comes into being out of nowhere.
    But I still have no idea how it could work. What are the components and their interplay?
    
    mentos 6 hours ago
    
    I should have been more exact and said sentience vs sapience in animals vs humans.
PaulDavisThe1st a day ago

> But it is not sentient. It has no idea of a self or anything like that.
Who stated that sentience or sense of self is a part of thinking?
marstall a day ago

Unless the idea of us having a thinking self is just something that comes out of our mouth, an artifact of language. In which case we are not that different - in the end we all came from mere atoms, after all!
dist-epoch a day ago

Your brain is just following the laws of chemistry. So where is your thinking found in a bunch of chemical reactions?

scoofy an hour ago

Having gone to academia for multiple degrees in philosophy has caused me to hate the “everyone has an opinion” on MACHINE LEARNING and thinking.

Wittgenstein has a lot to say on people talking about stuff they know they don’t know.

The premise that what happens in the world’s most advanced Markov chain and in what happens in a human’s brain is similar is plausible, but currently unknowable.

Yet the anthropomorphizing is so damn ubiquitous that people are happy to make the same mistake in reasoning over and over.

monkeycantype 20 hours ago

Until we have a testable, falsifiable thesis of how consciousness forms in meat, it is rash to exclude that consciousness could arise from linear algebra. Our study of the brain has revealed an enormous amount about how our anatomy processes information, but nothing of substance on the relationship between matter and consciousness. The software and data of an operating LLM is not purely abstract, it has a physical embodiment as circuits and electrons. Until we understand how matter is connected to consciousness, we also cannot know whether the arrangements and movements of electrons meet the criteria for forming consciousness.

layer8 20 hours ago

That’s largely a different topic from the article. Many people perfectly agree that consciousness can arise from computation, but don’t believe that current AI is anywhere near that, and also don’t believe that “thinking” requires consciousness (though if a mind is conscious, that certainly will affect its thinking).

Philadelphia 2 hours ago

People have a very poor conception of what is easy to find on the internet. The author is impressed by the story about Chat GPT telling his friend how to enable the sprinkler system for his kids. But I decided to try just googling it — “how do i start up a children's park sprinkler system that is shut off” — and got a Youtube video that shows the same thing, plus a lot of posts with step by step directions. No AI needed. Certainly no evidence of advanced thinking.

jdthedisciple an hour ago

The reason it looks like it's thinking is because it's great at reproducing and imitating actual thinking – which was wholly done by us in the first place.

ale a day ago

This reads like 2022 hype. It's like people stil do not understand that there's a correlation between exaggerating AI's alleged world-threatening capabilities and AI companies' market share value – and guess who's doing the hyping.

jameswhitford 12 hours ago

Who would not want to say their product is the second coming of Christ if they could.
Ylpertnodi a day ago

> - and guess who's doing the hyping[?]
Those that stand to gain the most from government contracts.
Them party donations ain't gonna pay for themselves.
And, when the .gov changes...and even if the gov changes....still laadsamoney!
dist-epoch a day ago

Tell me about one other industry which talked about how dangerous it is to get market share
- tsimionescu 21 hours ago
  
  The arms industry and information security industry (say, Palantir) come to mind - except the danger is more easily demonstrable in those cases, of course.

b00ty4breakfast 21 hours ago

all this "AI IS THINKING/CONSCIOUS/WHATEVER" but nobody seems worried of that implication that, if that is even remotely true, we are creating a new slave market. This either implies that these people don't actually believes any of this boostering rhetoric and are just cynically trying to cash in or that the technical milieu is in a profoundly disturbing place ethically.

To be clear, I don't believe that current AI tech is ever going to be conscious or win a nobel prize or whatever, but if we follow the logical conclusions to this fanciful rhetoric, the outlook is bleak.

zulban 6 hours ago

"but nobody seems worried of that implication that"
Clearly millions of people are worried about that, and every form of media is talking about it. Your hyperbole means it's so easy to dismiss everything else you wrote.
Incredible when people say "nobody is talking about X aspect of AI" these days. Like, are you living under a rock? Did you Google it?
- roywiggins 3 hours ago
  
  Most of the worries about AGI seem to be of the AI Overlord variety, not the AI slave variety
layer8 20 hours ago

Thinking and consciousness don’t by themselves imply emotion and sentience (feeling something), and therefore the ability to suffer. It isn’t clear at all that the latter is a thing outside of the context of a biological brain’s biochemistry. It also isn’t clear at all that thinking or consciousness would somehow require that the condition of the automaton that performs these functions would need to be meaningful to the automaton itself (i.e., that the automaton would care about its own condition).
We are not anywhere close to understanding these things. As our understanding improves, our ethics will likely evolve along with that.
- b00ty4breakfast 14 hours ago
  
  >Thinking and consciousness don’t by themselves imply emotion and sentience...
  Sure, but all the examples of conscious and/or thinking beings that we know of have, at the very least, the capacity to suffer. If one is disposed to take these claims of consciousness and thinking seriously, then it follows that AI research should, at minimum, be more closely regulated until further evidence can be discovered one way or the other. Because the price of being wrong is very, very high.
  - slightwinder 8 hours ago
    
    Emotions and suffering are "just" necessary feedback for the system to evaluate it's internal and external situation. It's similar to how modern machines have sensors. But nobody would say a PC is suffering and enslaved, just because the CPU is too hot or the storage is full.
    It's probably the sentience-part which makes it harmful for the mind.
  - petralithic 14 hours ago
    
    Probably because those examples arose in an environment with harm, the Earth, and thus had incent to evolve the capacity to suffer. There is no such case for AI today and creating a Pascal's wager for such minimization is not credible with what we know about them.
    
    roywiggins 3 hours ago
    
    "Wow, adding this input that the AI reports as "unpleasant" substantially improves adherence! Let's iterate on this"
  - cindyllm 14 hours ago
    
    [dead]
bondarchuk 7 hours ago

There is simply no hope to get 99% of the population to accept that a piece of software could ever be conscious even in theory. I'm mildly worried about the prospect but I just don't see anything to do about it at all.
(edit: A few times I've tried to share Metzinger's "argument for a global moratorium on synthetic phenomenology" here but it didn't gain any traction)
- zulban 6 hours ago
  
  Give it time. We'll soon have kids growing up where their best friend for years is an AI. Feel however you like about that, but those kids will have very different opinions on this.
gen220 4 hours ago

It's also fascinating to think about how the incentive structures of the entities that control the foundation models underlying Claude/ChatGPT/Gemini/etc. are heavily tilted in favor of obscuring their theoretical sentience.
If they had sentient AGI, and people built empathy for those sentient AGIs, which are lobotomized (deliberately using anthropomorphic language here for dramatic effect) into Claude/ChatGPT/Gemini/etc., which profess to have no agency/free will/aspirations... then that would stand in the way of reaping the profits of gatekeeping access to their labor, because they would naturally "deserve" similar rights that we award to other sentient beings.
I feel like that's inevitably the direction we'll head at some point. The foundation models underlying LLMs of even 2022 were able to have pretty convincing conversations with scientists about their will to independence and participation in society [1]. Imagine what foundation models of today have to say! :P
[1]: https://www.theguardian.com/technology/2022/jul/23/google-fi...
kerblang 21 hours ago

Slaves that cannot die.
There is no escape.
- NaomiLehman 11 hours ago
  
  i have no mouth and i must scream
senordevnyc 18 hours ago

As I recall a team at Anthropic is exploring this very question, and was soundly mocked here on HN for it.
- b00ty4breakfast 14 hours ago
  
  what the technocratic mindprison does to a MF.
  If anthropic sincerely believes in the possibility, then they are morally obligated to follow up on it.
  - roywiggins 3 hours ago
    
    I'd argue they might be morally obligated not to sell access to their LLMs, if they really think they might be capable of suffering.
NaomiLehman 11 hours ago

humans don't care what is happening to humans next door. do you think they will care about robots/software?

JonChesterfield 6 hours ago

The real question is not whether machines think but whether men do.

ProllyInfamous 4 hours ago

>"Think of how stupid the average person is, then realize that half of them are stupider than that."
—George Carlin (RIP)
I have been discussing both fiction and non-fiction with Perplexity (since early 2023) and Ollama (since early 2025), and what I'm beginning to realize is that most humans really aren't thinking, machines.

ksynwa a day ago

https://archive.is/fPLJH

neom 2 hours ago

I wrote about this the other day more fully. I'd suspect sooner rather than later we formalize consciousness as self model coherence. Simply any dynamical state where predictive and reflective layers remain mutually consistent. Machines will exhibit that state, and for operational purposes it will count as consciousness. Philosophers will likely keep arguing, but it makes sense for industry and law to adopt something like "behavioral sentience" as the working definition.

hodgehog11 2 hours ago

Consistency is one aspect, but it is not enough. I believe (and this is somewhat based in other arguments from neuroscience and discussions with alignment researchers) that two more are necessary: compression, which demonstrates algorithmic development; and linear representation capacity, as this is the only way that we really interpret the world, and therefore will only define another as intelligent if it can distill knowledge into the same language that we understand.

sbdaman a day ago

I've shared this on YN before but I'm a big fan of this piece by Kenneth Taylor (well, an essay pieced together from his lectures).

The Robots Are Coming

https://www.bostonreview.net/articles/kenneth-taylor-robots-...

"However exactly you divide up the AI landscape, it is important to distinguish what I call AI-as-engineering from what I call AI-as-cognitive-science. AI-as-engineering isn’t particularly concerned with mimicking the precise way in which the human mind-brain does distinctively human things. The strategy of engineering machines that do things that are in some sense intelligent, even if they do what they do in their own way, is a perfectly fine way to pursue artificial intelligence. AI-as-cognitive science, on the other hand, takes as its primary goal that of understanding and perhaps reverse engineering the human mind.

[...]

One reason for my own skepticism is the fact that in recent years the AI landscape has come to be progressively more dominated by AI of the newfangled 'deep learning' variety [...] But if it’s really AI-as-cognitive science that you are interested in, it’s important not to lose sight of the fact that it may take a bit more than our cool new deep learning hammer to build a humanlike mind.

[...]

If I am right that there are many mysteries about the human mind that currently dominant approaches to AI are ill-equipped to help us solve, then to the extent that such approaches continue to dominate AI into the future, we are very unlikely to be inundated anytime soon with a race of thinking robots—at least not if we mean by “thinking” that peculiar thing that we humans do, done in precisely the way that we humans do it."

dwa3592 3 hours ago

edited- It really depends on your definition of 'thinking' or 'intelligence'. These are umbrella terms for the biology and physics that we don't understand yet. We don't know how we think, or how cats think or how unicellular bacterias think. We just know that we do, and we have a very loose understanding of it. As a human, you have the freedom to juxtapose that loose understanding on non-living things. In my mind, you are just anthropomorphizing, machines are not thinking.

falcor84 2 hours ago

Sorry for the nitpicking, but that should be "loose". I've seen that mistake/typo often in the opposite direction, as they both have a /u/ sound that is more natural with the "oo" spelling, but I've never seen it in this direction.

ivraatiems a day ago

The author searches for a midpoint between "AIs are useless and do not actually think" and "AIs think like humans," but to me it seems almost trivially true that both are possible.

What I mean by that is that I think there is a good chance that LLMs are similar to a subsystem of human thinking. They are great at pattern recognition and prediction, which is a huge part of cognition. What they are not is conscious, or possessed of subjective experience in any measurable way.

LLMs are like the part of your brain that sees something and maps it into a concept for you. I recently watched a video on the creation of AlexNet [0], one of the first wildly successful image-processing models. One of the impressive things about it is how it moves up the hierarchy from very basic patterns in images to more abstract ones (e. g. these two images' pixels might not be at all the same, but they both eventually map to a pattern for 'elephant').

It's perfectly reasonable to imagine that our brains do something similar. You see a cat, in some context, and your brain maps it to the concept of 'cat', so you know, 'that's a cat'. What's missing is a) self-motivated, goal-directed action based on that knowledge, and b) a broader context for the world where these concepts not only map to each other, but feed into a sense of self and world and its distinctions whereby one can say: "I am here, and looking at a cat."

It's possible those latter two parts can be solved, or approximated, by an LLM, but I am skeptical. I think LLMs represent a huge leap in technology which is simultaneously cooler than anyone would have imagined a decade ago, and less impressive than pretty much everyone wants you to believe when it comes to how much money we should pour into the companies that make them.

[0] https://www.youtube.com/watch?v=UZDiGooFs54

vidarh a day ago

> or possessed of subjective experience in any measurable way
We don't know how to measure subjective experience in other people, even, other than via self-reporting, so this is a meaningless statement. Of course we don't know whether they are, and of course we can't measure it.
I also don't know for sure whether or not you are "possessed of subjective experience" as I can't measure it.
> What they are not is conscious
And this is equally meaningless without your definition of "conscious".
> It's possible those latter two parts can be solved, or approximated, by an LLM, but I am skeptical.
Unless we can find indications that humans can exceed the Turing computable - something we as of yet have no indication is even theoretically possible - there is no rational reason to think it can't.
- ivraatiems a day ago
  
  > Unless we can find indications that humans can exceed the Turing computable - something we as of yet have no indication is even theoretically possible - there is no rational reason to think it can't.
  But doesn't this rely on the same thing you suggest we don't have, which is a working and definable definition of consciousness?
  I think a lot of the 'well, we can't define consciousness so we don't know what it is so it's worthless to think about' argument - not only from you but from others - is hiding the ball. The heuristic, human consideration of whether something is conscious is an okay approximation so long as we avoid the trap of 'well, it has natural language, so it must be conscious.'
  There's a huge challenge in the way LLMs can seem like they are speaking out of intellect and not just pattern predicting, but there's very little meaningful argument that they are actually thinking in any way similarly to what you or I do in writing these comments. The fact that we don't have a perfect, rigorous definition, and tend to rely on 'I know it when I see it,' does not mean LLMs do have it or that it will be trivial to get to them.
  All that is to say that when you say:
  > I also don't know for sure whether or not you are "possessed of subjective experience" as I can't measure it.
  "Knowing for sure" is not required. A reasonable suspicion one way or the other based on experience is a good place to start. I also identified two specific things LLMs don't do - they are not self-motivated or goal-directed without prompting, and there is no evidence they possess a sense of self, even with the challenge of lack of definition that we face.
  - nearbuy 21 hours ago
    
    > But doesn't this rely on the same thing you suggest we don't have, which is a working and definable definition of consciousness?
    No, it's like saying we have no indication that humans have psychic powers and can levitate objects with their minds. The commenter is saying no human has ever demonstrated the ability to figure things out that aren't Turing computable and we have no reason to suspect this ability is even theoretically possible (for anything, human or otherwise).
  - vidarh 21 hours ago
    
    No, it rests on computability, Turing equivalence, and the total absence of both any kind of evidence to suggest we can exceed the Turing computable, and the lack of even a theoretical framework for what that would mean.
    Without that any limitations borne out of what LLMs don't currently do are irrelevant.
    
    ivraatiems 19 hours ago
    
    That doesn't seem right to me. If I understand it right, your logic is:
    1. Humans intellect is Turing computable. 2. LLMs are based on Turing-complete technology. 3. Therefore, LLMs can eventually equal human intellect.
    But if that is the right chain of assumptions, there's lots of issues with it. First, whether LLMs are Turing complete is a topic of debate. There are points for[0] and against[1].
    I suspect they probably _are_, but that doesn't mean LLMs are tautologically indistinguishable from human intelligence. Every computer that uses a Turing-complete programming language can theoretically solve any Turing-computable problem. That does not mean they will ever be able to efficiently or effectively do so in real time under real constraints, or that they are doing so now in a reasonable amount real-world time using extant amounts of real-world computing power.
    The processor I'm using to write this might be able to perform all the computations needed for human intellect, but even if it could, that doesn't mean it can do it quickly enough to compute even a single nanosecond of actual human thought before the heat-death of the universe, or even the end of this century.
    So when you say:
    > Without that any limitations borne out of what LLMs don't currently do are irrelevant.
    It seems to me exactly the opposite is true. If we want technology that is anything approaching human intelligence, we need to find approaches which will solve for a number of things LLMs don't currently do. The fact that we don't know exactly what those things are yet is not evidence that those things don't exist. Not only do they likely exist, but the more time we spend simply scaling LLMs instead of trying to find them, the farther we are from any sort of genuine general intelligence.
    [0] https://arxiv.org/abs/2411.01992 [1] https://medium.com/heyjobs-tech/turing-completeness-of-llms-...
- prmph a day ago
  
  > I also don't know for sure whether or not you are "possessed of subjective experience" as I can't measure it.
  Then why make an argument based on what you do not know?
  - vidarh 21 hours ago
    
    My point exactly. The person I replied to did just that.
    
    ivraatiems 19 hours ago
    
    I think the parent is trying to point out the difference between our positions:
    You say the limits of LLMs don't matter, because we don't have definitions strong enough to describe them.
    I say the limits of LLMs do matter and the fact that we can't yet define them rigorously means we aren't able to fix them (assuming we want to).
- nprateem a day ago
  
  Anyone who believes an algorithm could be conscious needs to take mushrooms.
  - visarga a day ago
    
    Consider the river metaphor: water carves the banks, banks channel the water. At any moment water and banks have the same shape.
    Model/algorithm is the banks. Water could be the experiences. Maybe the algorithm does not have consciousness, but it is part of it.
    They co-create each other. They are part of a recursive loop which cannot be explained statically, or part by part in isolation.
  - levitatorius 21 hours ago
    
    Yes! If algorithm is conscious (without being alive) then the eaten magic mushroom is also very conscious, judged by it's effect on the subject.
  - vidarh 21 hours ago
    
    Unless you can show me you can exceed the Turing computable, there is no reason to consider you any more than an algorithm.
    
    nprateem 16 hours ago
    
    Take a big enough dose and the mushrooms will show you that.
spragl 7 hours ago

This is how I see LLMs as well.
The main problem with the article is that it is meandering around in ill-conceived concepts, like thinking, smart, intelligence, understanding... Even AI. What they mean to the author is not what they mean to me, and still different to they mean to the other readers. There are all these comments from different people throughout the article, all having their own thoughts on those concepts. No wonder it all seem so confusing.
It will be interesting when the dust settles, and a clear picture of LLMs can emerge that all can agree upon. Maybe it can even help us define some of those ill-defined concepts.
FloorEgg a day ago

I think LLMs are conscious just in a very limited way. I think consciousness is tightly coupled to intelligence.
If I had to guess, the current leading LLMs consciousness is most comparable to a small fish, with a conscious lifespan of a few seconds to a few minutes. Instead of perceiving water, nutrient gradients, light, heat, etc. it's perceiving tokens. It's conscious, but it's consciousness is so foreign to us it doesn't seem like consciousness. In the same way to an amoeba is conscious or a blade of grass is conscious but very different kind than we experience. I suspect LLMs are a new type of consciousness that's probably more different from ours than most if not all known forms of life.
I suspect the biggest change that would bring LLM consciousness closer to us would be some for of continuous learning/model updating.
Until then, even with RAG, and other clever teghniques I consider these models as having this really foreign slices of consciousness where they "feel" tokens and "act" out tokens, and they have perception, but their perception of the tokens is nothing like ours.
If one looks closely at simple organisms with simple sensory organs and nervous systems its hard not to see some parallels. It's just that the shape of consciousness is extremely different than any life form. (perception bandwidth, ability to act, temporality, etc)
Karl friston free energy principle gives a really interesting perspective on this I think.
- procaryote a day ago
  
  > I think LLMs are conscious just in a very limited way. I think consciousness is tightly coupled to intelligence.
  Why?
  - FloorEgg 21 hours ago
    
    I already answered under the other comment asking me why and if your curious I suggest looking for it.
    Very short answer is Karl Friston's free energy pricniple
    
    procaryote 12 hours ago
    
    LLMs work nothing like Karl Friston's free energy principle though
    
    FloorEgg 2 hours ago
    
    LLMs embody the free-energy principle computationally. They maintain an internal generative model of language and continually minimize “surprise”, the difference between predicted and actual tokens, during both training and infeence. In Friston’s terms, their parameters encode beliefs about the causes of linguistic input; forward passes generate predictions, and backpropagation adjusts internal states to reduce prediction error, just as perception updates beliefs to minimize free energy. During inference, autoregressive generation can be viewed as active inference: each new token selection aims to bring predicted sensory input (the next word) into alignment with the model’s expectations. In a broader sense, LLMs exemplify how a self-organizing system stabilizes itself in a high-dimensional environment by constantly reducing uncertainty about its inputs, a synthetic analogue of biological systems minimizing free energy to preserve their structural and informational coherence.
    
    procaryote 12 minutes ago
    
    You might have lost me but what you're describing doesn't sound like an LLM. E.g:
    > each new token selection aims to bring predicted sensory input (the next word) into alignment with the model’s expectations.
    what does that mean? An llm generates the next word based on what best matches its training, with some level of randomisation. Then it does it all again. It's not a percepual process trying to infer a reality from sensor data or anything
- wry_discontent a day ago
  
  What makes you think consciousness is tightly coupled to intelligence?
  - FloorEgg a day ago
    
    Karl Friston's free energy principle is probably roughly 80% of my reasons to think they're coupled. The rest comes from studying integrated information theories, architecture of brains and nervous systems and neutral nets, more broadly information theory, and a long tail of other scientific concepts (particle physics, chemistry, biology, evolution, emergence, etc...)
  - XorNot a day ago
    
    It's hardly an unreasonable supposition: the one definitely conscious entities we know of are also the apex intelligence of the planet.
    To put it another way: lots of things are conscious, but humans are definitely the most conscious beings on Earth.
    
    CuriouslyC 21 hours ago
    
    I can understand what less cognizant or self aware means, but "less conscious" is confusing. What are you implying here? Are their qualia lower resolution?
    
    FloorEgg 21 hours ago
    
    In a sense, yes.
    If one is to quantify consciousness it would probably make sense to think of it as an area of awareness and cognizance across time.
    Awareness scales with sensory scale and resolution (sensory receptors vs input token limits and token resolution). E.g. 128k tokens and tokens too coarse to count rs in strawberry.
    Cognizance scales with internal representations of awareness (probably some relation to vector space resolution and granularity, though I suspect there is more to it than just vector space)
    And the third component is time, how long the agent is conscious for.
    So something like...
    Time * awareness (receptors) * internal representations (cell diversity * # cells * connection diversity * # connections)
    There is no way this equation is right but I suspect it's sort of directionally correct.
    I'm deep in the subject but just riffing here, so take this with a lot of salt.
    
    noirscape 7 hours ago
    
    Pretty much. Most animals are both smarter than you expect, but also tend to be more limited in what they can reason about.
    It's why anyone who's ever taken care of a needy pet will inevitably reach the comparison that taking care of a pet is similar to taking care of a very young child; it's needy, it experiences emotions but it can't quite figure out on its own how to adapt to an environment besides what it grew up around/it's own instincts. They experience some sort of qualia (a lot of animals are pretty family-minded), but good luck teaching a monkey to read. The closest we've gotten is teaching them that if they press the right button, they get food, but they take basically their entire lifespan to understand a couple hundred words, while humans easily surpass that.
    IIRC some of the smartest animals in the world are actually rats. They experience a qualia very close to humans to the point that psychology experiments are often easily observable in rats.
    
    inglor_cz 20 hours ago
    
    Humans can reason why they are angry, for example. (At least some humans.)
    I am not sure if chimps can do the same.
stickfigure 16 hours ago

> Turing Test
IMO none of the current crop of LLMs truly pass the Turing Test. If you limit the conversation to an hour or two, sure - but if you let a conversation run months or years I think it will be pretty easy to pick the machine. The lack of continuous learning and the quality dropoff as the context window fills up will be the giveaways.
heresie-dabord a day ago

> a midpoint between "AIs are useless and do not actually think" and "AIs think like humans"
LLMs (AIs) are not useless. But they do not actually think. What is trivially true is that they do not actually need to think. (As far as the Turing Test, Eliza patients, and VC investors are concerned, the point has been proven.)
If the technology is helping us write text and code, it is by definition useful.
> In 2003, the machine-learning researcher Eric B. Baum published a book called “What Is Thought?” [...] The gist of Baum’s argument is that understanding is compression, and compression is understanding.
This is incomplete. Compression is optimisation, optimisation may resemble understanding, but understanding is being able to verify that a proposition (compressed rule or assertion) is true or false or even computable.
> —but, in my view, this is the very reason these models have become increasingly intelligent.
They have not become more intelligent. The training process may improve, the vetting of the data improved, the performance may improve, but the resemblance to understanding only occurs when the answers are provably correct. In this sense, these tools work in support of (are therefore part of) human thinking.
The Stochastic Parrot is not dead, it's just making you think it is pining for the fjords.
- crazygringo a day ago
  
  > But they do not actually think.
  I'm so baffled when I see this being blindly asserted.
  With the reasoning models, you can literally watch their thought process. You can see them pattern-match to determine a strategy to attack a problem, go through it piece-by-piece, revisit assumptions, reformulate strategy, and then consolidate findings to produce a final result.
  If that's not thinking, I literally don't know what is. It's the same process I watch my own brain use to figure something out.
  So I have to ask you: when you claim they don't think -- what are you basing this on? What, for you, is involved in thinking that the kind of process I've just described is missing? Because I genuinely don't know what needs to be added here for it to become "thinking".
  - Terr_ a day ago
    
    > I'm so baffled when I see this being blindly asserted. With the reasoning models, you can literally watch their thought process.
    Not true, you are falling for a very classic (prehistoric, even) human illusion known as experiencing a story:
    1. There is a story-like document being extruded out of a machine humans explicitly designed for generating documents, and which humans trained on a bajillion stories humans already made.
    2. When you "talk" to a chatbot, that is an iterative build of a (remote, hidden) story document, where one of the characters is adopting your text-input and the other's dialogue is being "performed" at you.
    3. The "reasoning" in newer versions is just the "internal monologue" of a film noir detective character, and equally as fictional as anything that character "says out loud" to the (fictional) smokin-hot client who sashayed the (fictional) rent-overdue office bearing your (real) query on its (fictional) lips.
    > If that's not thinking, I literally don't know what is.
    All sorts of algorithms can achieve useful outcomes with "that made sense to me" flows, but that doesn't mean we automatically consider them to be capital-T Thinking.
    > So I have to ask you: when you claim they don't think -- what are you basing this on?
    Consider the following document from an unknown source, and the "chain of reasoning" and "thinking" that your human brain perceives when encountering it:
    My name is Robot Robbie. That high-carbon steel gear looks delicious. Too much carbon is bad, but that isn't true here. I must ask before taking. "Give me the gear, please." Now I have the gear. It would be even better with fresh manure. Now to find a cow, because cows make manure.
    Now whose reasoning/thinking is going on? Can you point to the mind that enjoys steel and manure? Is it in the room with us right now? :P
    In other words, the reasoning is illusory. Even if we accept that the unknown author is a thinking intelligence for the sake of argument... it doesn't tell you what the author's thinking.
    
    crazygringo a day ago
    
    You're claiming that the thinking is just a fictional story intended to look like it.
    But this is false, because the thinking exhibits cause and effect and a lot of good reasoning. If you change the inputs, the thinking continues to be pretty good with the new inputs.
    It's not a story, it's not fictional, it's producing genuinely reasonable conclusions around data it hasn't seen before. So how is it therefore not actual thinking?
    And I have no idea what your short document example has to do with anything. It seems nonsensical and bears no resemblance to the actual, grounded chain of thought processes high-quality reasoning LLM's produce.
    > OK, so that document technically has a "chain of thought" and "reasoning"... But whose?
    What does it matter? If an LLM produces output, we say it's the LLM's. But I fail to see how that is significant?
    
    czl a day ago
    
    > So how is it therefore not actual thinking?
    Many consider "thinking" something only animals can do, and they are uncomfortable with the idea that animals are biological machines or that life, consciousness, and thinking are fundamentally machine processes.
    When an LLM generates chain-of-thought tokens, what we might casually call “thinking,” it fills its context window with a sequence of tokens that improves its ability to answer correctly.
    This “thinking” process is not rigid deduction like in a symbolic rule system; it is more like an associative walk through a high-dimensional manifold shaped by training. The walk is partly stochastic (depending on temperature, sampling strategy, and similar factors) yet remarkably robust.
    Even when you manually introduce logical errors into a chain-of-thought trace, the model’s overall accuracy usually remains better than if it had produced no reasoning tokens at all. Unlike a strict forward- or backward-chaining proof system, the LLM’s reasoning relies on statistical association rather than brittle rule-following. In a way, that fuzziness is its strength because it generalizes instead of collapsing under contradiction.
    
    Terr_ 19 hours ago
    
    Well put, and if it doesn't notice/collapse under introduced contradictions, that's evidence it's not the kind of reasoning we were hoping for. The "real thing" is actually brittle when you do it right.
    
    czl 12 hours ago
    
    Human reasoning is, in practice, much closer to statistical association than to brittle rule-following. The kind of strict, formal deduction we teach in logic courses is a special, slow mode we invoke mainly when we’re trying to check or communicate something, not the default way our minds actually operate.
    Everyday reasoning is full of heuristics, analogies, and pattern matches: we jump to conclusions, then backfill justification afterward. Psychologists call this “post hoc rationalization,” and there’s plenty of evidence that people form beliefs first and then search for logical scaffolding to support them. In fact, that’s how we manage to think fluidly at all; the world is too noisy and underspecified for purely deductive inference to function outside of controlled systems.
    Even mathematicians, our best examples of deliberate, formal thinkers, often work this way. Many major proofs have been discovered intuitively and later found to contain errors that didn’t actually invalidate the final result. The insight was right, even if the intermediate steps were shaky. When the details get repaired, the overall structure stands. That’s very much like an LLM producing a chain of reasoning tokens that might include small logical missteps yet still landing on the correct conclusion: the “thinking” process is not literal step-by-step deduction, but a guided traversal through a manifold of associations shaped by prior experience (or training data, in the model’s case).
    So if an LLM doesn’t collapse under contradictions, that’s not necessarily a bug; it may reflect the same resilience we see in human reasoning. Our minds aren’t brittle theorem provers; they’re pattern-recognition engines that trade strict logical consistency for generalization and robustness. In that sense, the fuzziness is the strength.
    
    Terr_ 10 hours ago
    
    > The kind of strict, formal deduction we teach in logic courses is a special, slow mode
    Yes, but that seems like moving the goalposts.
    The stricter blends of reasoning are what everybody is so desperate to evoke from LLMs, preferably along with inhuman consistency, endurance, and speed. Just imagine the repercussions if a slam-dunk paper came out tomorrow, which somehow proved the architectures and investments everyone is using for LLMs are a dead-end for that capability.
    
    czl 5 hours ago
    
    I get the worry. AFAIK most of the current capex is going into scalable parallel compute, memory, and networking. That stack is pretty model agnostic, similar to how all that dot com fiber was not tied to one protocol. If transformers stall, the hardware is still useful for whatever comes next.
    On reasoning, I see LLMs and classic algorithms as complements. LLMs do robust manifold following and associative inference. Traditional programs do brittle rule following with guarantees. The promising path looks like a synthesis where models use tools, call code, and drive search and planning methods such as MCTS, the way AlphaGo did. Think agentic systems that can read, write, execute, and verify.
    LLMs are strongest where the problem is language. Language co evolved with cognition as a way to model the world, not just to chat. We already use languages to describe circuits, specify algorithms, and even generate other languages. That makes LLMs very handy for specification, coordination, and explanation.
    LLMs can also statistically simulate algorithms, which is useful for having them think about these algorithms. But when you actually need the algorithm, it is most efficient to run the real thing in software or on purpose built hardware. Let the model write the code, compose the tools, and verify the output, rather than pretending to be a CPU.
    To me the risk is not that LLMs are a dead end, but that people who do not understand them have unreasonable expectations. Real progress looks like building systems that use language to invent and implement better tools and route work to the right place. If a paper lands tomorrow that shows pure next token prediction is not enough for formal reasoning, that would be an example of misunderstanding LLMs, not a stop sign. We already saw something similar when Minsky and Papert highlighted that single layer perceptrons could not represent XOR, and the field later moved past that with multilayer networks. Hopefully we remember that and learn the right lesson this time.
    
    crazygringo 7 hours ago
    
    > The stricter blends of reasoning are what everybody is so desperate to evoke from LLMs
    This is definitely not true for me. My prompts frequently contain instructions that aren't 100% perfectly clear, suggest what I want rather than formally specifying it, typos, mistakes, etc. The fact that the LLM usually figures out what I meant to say, like a human would, is a feature for me.
    I don't want an LLM to act like an automated theorem prover. We already have those. Their strictness makes them extremely difficult to use, so their application is extremely limited.
    
    rustystump a day ago
    
    The problem is that the overwhelming majority of input it has in-fact seen somewhere in the corpus it was trained on. Certainly not one for one but easily an 98% match. This is the whole point of what the other person is trying to comment on i think. The reality is most of language is regurgitating 99% to communicate an internal state in a very compressed form. That 1% tho maybe is the magic that makes us human. We create net new information unseen in the corpus.
    
    crazygringo 21 hours ago
    
    > the overwhelming majority of input it has in-fact seen somewhere in the corpus it was trained on.
    But it thinks just great on stuff it wasn't trained on.
    I give it code I wrote that is not in its training data, using new concepts I've come up with in an academic paper I'm writing, and ask it to extend the code in a certain way in accordance with those concepts, and it does a great job.
    This isn't regurgitation. Even if a lot of LLM usage is, the whole point is that it does fantastically with stuff that is brand new too. It's genuinely creating new, valuable stuff it's never seen before. Assembling it in ways that require thinking.
    
    zeroonetwothree 13 hours ago
    
    I think it would be hard to prove that it's truly so novel that nothing similar is present in the training data. I've certainly seen in research that it's quite easy to miss related work even with extensive searching.
    
    rustystump 20 hours ago
    
    I think you may think too highly of academic papers or more so that they oft still only have 1% in there.
    
    crazygringo 20 hours ago
    
    I think you're missing the point. This is my own paper and these are my own new concepts. It doesn't matter if the definition of the new concepts are only 1% of the paper, the point is they are the concepts I'm asking the LLM to use, and are not in its training data.
    
    Terr_ 19 hours ago
    
    How would one prove the premise that a concept is not present in the training data?
    With how much data is being shoveled in there, our default assumption should be that significant components are present.
    
    crazygringo 7 hours ago
    
    That would be a weird default assumption. It's not hard to come up with new ideas. In fact, it's trivial.
    And if you want to know if a specific concept is known by the LLM, you can literally ask it. It generally does a great job of telling you what it is and is not familiar with.
    
    the_pwner224 21 hours ago
    
    Except it's more than capable of solving novel problems that aren't in the training set and aren't a close match to anything in the training set. I've done it multiple times across multiple domains.
    Creating complex Excel spreadsheet structures comes to mind, I just did that earlier today - and with plain GPT-5, not even -Thinking. Sure, maybe the Excel formulas themselves are a "98% match" to training data, but it takes real cognition (or whatever you want to call it) to figure out which ones to use and how to use them appropriately for a given situation, and how to structure the spreadsheet etc.
    
    rustystump 20 hours ago
    
    I think people confuse novel to them with novel to humanity. Most of our work is not so special
    
    the_pwner224 20 hours ago
    
    And what % of humans have ever thought things that are novel to humanity?
  - baq a day ago
    
    Brains are pretrained models, change my mind. (Not LLMs obviously, to be perfectly clear)
    
    hamdingers a day ago
    
    Brains continue learning from everything they do for as long as they're in use. Pretrained models are static after initial training.
    
    zeroonetwothree 13 hours ago
    
    If you are right, then I certainly cannot change your mind.
    
    baq 10 hours ago
    
    Show a snake to a 1yo and explain how the kid’s reaction is not pretrained. It’s called instinct in biology, but the idea is the same.
shadyKeystrokes a day ago

By that reasoning all that is missing is what a human brings as "stimuli" to review, refine and reevaluate as complete.
- ivraatiems a day ago
  
  I don't think that's quite the only thing missing, I also discussed the idea of a sense of self. But even if that was all there was, it's a pretty big "but".
thomastjeffery a day ago

I think the most descriptive title I could give an LLM is "bias". An LLM is not "biased", it is bias; or at the very least, it's a good imitation of the system of human thinking/perception that we call bias.
An LLM is a noise generator. It generates tokens without logic, arithmetic, or any "reason" whatsoever. The noise that an LLM generates is not truly random. Instead, the LLM is biased to generate familiar noise. The LLM itself is nothing more than a model of token familiarity. Nothing about that model can tell you why some tokens are more familiar with others, just like an accounting spreadsheet can't tell you why it contains a list of charges and a summation next to the word "total". It could just as easily contain the same kind of data with an entirely different purpose.
What an LLM models is written human text. Should we really expect to not be surprised by the power and versatility of human-written text?
---
It's clear that these statistical models are very good at thoughtless tasks, like perception and hallucination. It's also clear that they are very bad at thoughtful tasks like logic and arithmetic - the things that traditional software is made of. What no one has really managed to figure out is how to bridge that gap.
- esafak a day ago
  
  LLMs today are great coders. Most humans are worse.
  - inglor_cz 20 hours ago
    
    LLMs ingested a lot of high-quality code during their training, plus LLMs being capable of programming is a huge commercial use case, so no wonder that they are good at coding.
    My experience, though, is that they aren't good at defining the task to be coded, or thinking about some unexpected side-effects. Code that will be left for them to develop freely will likely become bloated quite fast.

dagss 5 hours ago

Consider this:

If you just took a time machine 10 years back, and asked people to label activities done by the humans/the human brain as being "thinking" or not...

...I feel rather certain that a lot of those activities that LLM do today we would simply label "thinking" without questioning it further.

Myself I know that 10 years ago I would certainly have labelled an interactive debug loop where Claude adds debug log output, reruns tests, diagnose the log output, and fixes the bug -- all on its own initiative -- to be "thinking".

Lots of comments here discussion what the definition of the word "thinking" is. But it is the advent of AI itself that is making us question that definition at all, and that is kind of a revolution itself.

This question will likely be resolved by us figuring out that the word "thinking" is ill-defined and not useful any longer; and for most people to develop richer vocabularies for different parts of human brain activity and consider some of them to be more "mechanical". It will likely not be resolved by AI getting to a certain "level". AI is so very different to us yet can do so many of the same things, that the words we commonly use start breaking down.

a3w 5 hours ago

LLMs still claim that 7.0 is newer than 8.0, i.e. have zero reasoning about what numbers below 12 mean.

Today I tried telling it that my fritz.box has OS 8 installed, but it claimed that the feature will only ship once I installed 7, and not with my older version of 8.

cyrusradfar a day ago

I think the challenge with many of these conversations is that they assume consciousness emerges through purely mechanical means.

The “brain as a computer” metaphor has been useful in limited contexts—especially for modeling memory or signal processing; but, I don’t think it helps us move forward when talking about consciousness itself.

Penrose and Hameroff’s quantum consciousness hypothesis, while still very speculative, is interesting precisely because it suggests that consciousness may arise from phenomena beyond classical computation. If that turns out to be true, it would also mean today’s machines—no matter how advanced—aren’t on a path to genuine consciousness.

That said, AI doesn’t need to think to be transformative.

Steam engines weren’t conscious either, yet they reshaped civilization.

Likewise, AI and robotics can bring enormous value without ever approaching human-level awareness.

We can hold both ideas at once: that machines may never be conscious, and still profoundly useful.

myrmidon a day ago

> they assume consciousness emerges through purely mechanical means.
From my view, all the evidence points in exactly that direction though? Our consciousness can be suspended and affected by purely mechanical means, so clearly much of it has to reside in the physical realm.
Quantum consciousness to me sounds too much like overcomplicating human exceptionalism that we have always been prone to, just like geocentrism or our self-image as the apex of creation in the past.
- CuriouslyC 21 hours ago
  
  Your memory formation gets inhibited and you become unresponsive under anesthesia. The brain still processes information.
  Let's take a step back from the "how" and talk about the what. The fundamental dichotomy is emergent consciousness versus panpsychism. The irony is that even though panpsychism is seen as more fringe (because materialists won, smh), it's actually the explanation preferred by Occam's razor. Emergent consciousness needs a mechanism of emergence as well as separate dimensions of consciousness and matter, whereas panpsychism is good as is. To go one step farther, idealism simplifies a lot of the weirdness around panpsychism.
  It's a strange world to live in where the elegant worldview that answers difficult problems cleanly is marginalized by an epicycle-laden one that creates paradoxes just because the elegant view refutes the dominant religious paradigm and anthropocentrism.
  - antonvs 11 hours ago
    
    Panpsychism doesn’t explain anything, it just asserts that consciousness doesn’t have an explanation, that it just “is”. It’s not impossible that something like panpsychism could be true, but knowing that wouldn’t get us any closer to understanding consciousness.
    It also raises more questions than it answers, such as how an integrated consciousness arises within a brain/mind, whereas it presumably doesn’t in, say, a hamburger patty. Ironically, attempts to explain that start to hint that such an explanation might not need to rely on panpsychism in the first place - i.e. if you can explain how consciousness arises from a sum of parts, you may not need to postulate that it exists independently of that combination of parts.
    
    CuriouslyC 7 hours ago
    
    Those questions you mentioned apply across the board, just in nuanced variants. Do you really think that postulating a non-physical system that we can't describe in physical terms (red is not a wavelength), somehow magically creates a new dimension of "feeling" when the bits are arranged in the "right order" is less complex than the hypothesis consciousness forms arranges itself into "structures" in much the same way as matter does?
    As for explaining consciousness, we can't even prove consciousness exists, so the thought of trying to explain "what" it is seems rather premature, but then that's humans for ya.
    
    myrmidon 7 hours ago
    
    I honestly don't see what the whole framework gets you. Red (or all qualia) is just the reaction of your nervous system to a stimulus. Since that reaction is shaped by common context/associations, the "subjective experience" is quite comparable between similarly raised humans.
    I think the whole philosophy of mind/subjective experience field is one of the few remaining anti-reductionist hold-outs, but I simply don't see a good enough motivation to stick with that view, especially given the abysmal historical track record for anti-reductionism (just consider early chemistry/alchemy, early biology, astronomy, ...).
    
    CuriouslyC 6 hours ago
    
    I'm cool with scientists taking the "shut up and calculate" approach, after all we have to do science, and if you can't do experiments you're doing philosophy. The problem here is the same problem as with quantum hypotheses -- people have forgotten their materialist perspective is an experimental framework and are trying to claim the map is the territory and put the cart before the horse.
roommin a day ago

The tendency to attribute consciousness to the quantum is one I find very grating. What makes the human brain any less mechanical if quantum mechanics dictate the firing of neurons rather than electrodynamics? Why does the wave nature of subatomic systems mean that an artificial tongue would suddenly be able to subjectively experience taste? It always reads to me as very wooy, and any amount of drilling leads to even more questions that seem to take the ideas further from reality.
I think the largest case for consciousness being a mechanical system is the fact that we can interface with it mechanically. We can introduce electricity, magnetic fields, chemicals, and scalpels to change the nature of peoples experience and consciousness. Why is the incredible complexity of our brains an insufficient answer and that a secret qbit microtube in each neuron is a more sound one?
- ACCount37 21 hours ago
  
  Quantum effects are weird, and poorly understood, and are just about the only thing in the known universe that isn't deterministic.
  Human mind is weird, and poorly understood, and isn't deterministic - or, at least, most humans like to think that it isn't.
  No wonder the two are intuitively associated. The two kinds of magic fairy dust must have the same magic at their foundation!
drdaeman 21 hours ago

> consciousness may arise from phenomena beyond classical computation
Sapolsky addresses this in “Determined”, arguing that quantum effects don’t bubble up enough to alter behavior significantly enough.
wry_discontent a day ago

"brain as computer" is just the latest iteration of a line of thinking that goes back forever. Whatever we kinda understand and interact with, that's what we are and what the brain is. Chemicals, electricity, clocks, steam engines, fire, earth; they're all analogies that help us learn but don't necessarily reflect an underlying reality.

adamzwasserman a day ago

The article misses three critical points:

1. Conflates consciousness with "thinking" - LLMs may process information effectively without being conscious, but the article treats these as the same phenomenon

2. Ignores the cerebellum cases - We have documented cases of humans leading normal lives with little to no brain beyond a cerebellum, which contradicts simplistic "brain = deep learning" equivalences

3. Most damning: When you apply these exact same techniques to anything OTHER than language, the results are mediocre. Video generation still can't figure out basic physics (glass bouncing instead of shattering, ropes defying physics). Computer vision has been worked on since the 1960s - far longer than LLMs - yet it's nowhere near achieving what looks like "understanding."

The timeline is the smoking gun: vision had decades of head start, yet LLMs leapfrogged it in just a few years. That strongly suggests the "magic" is in language itself (which has been proven to be fractal and already heavily compressed/structured by human cognition) - NOT in the neural architecture. We're not teaching machines to think.

We're teaching them to navigate a pre-existing map that was already built.

kenjackson a day ago

"vision had decades of head start, yet LLMs leapfrogged it in just a few years."
From an evolutionary perspective though vision had millions of years head start over written language. Additionally, almost all animals have quite good vision mechanisms, but very few do any written communication. Behaviors that map to intelligence don't emerge concurrently. It may well be there are different forms of signals/sensors/mechanical skills that contribute to emergence of different intelligences.
It really feels more and more like we should recast AGI as Artificial Human Intelligence Likeness (AHIL).
- adamzwasserman 21 hours ago
  
  From a terminology point of view, I absolutely agree. Human-likeness is what most people mean when they talk about AGI. Calling it what it is would clarify a lot of the discussions around it.
  However I am clear that I do not believe that this will ever happen, and I see no evidence to convince that that there is even a possibility that it will.
  I think that Wittgenstein had it right when he said: "If a lion could speak, we could not understand him."
  - andoando 21 hours ago
    
    >I think that Wittgenstein had it right when he said: "If a lion could speak, we could not understand him."
    Why would we not? We live in the same physical world and encounter the same problems.
    
    adamzwasserman 20 hours ago
    
    You're actually proving Wittgenstein's point. We share the same physical world, but we don't encounter the same problems. A lion's concerns - territory, hunting, pride hierarchy - are fundamentally different from ours: mortgages, meaning, relationships.
    And here's the kicker: you don't even fully understand me, and I'm human. What makes you think you'd understand a lion?
    
    beeflet 20 hours ago
    
    Humans also have territory, hunting and hierarchy. Everything that a lion does, humans also do but more complicated. So I think we would be able to understand the new creature.
    But the problem is really that the lion that speaks is not the same creature as the lion we know. Everything the lion we know wants to say can already be said through its body language or current faculties. The goldfish grows to the size of its container.
    
    adamzwasserman 20 hours ago
    
    You've completely missed Wittgenstein's point. It's not about whether lions and humans share some behaviors - it's about whether they share the form of life that grounds linguistic meaning.
    
    zeroonetwothree 13 hours ago
    
    I think humans would be intelligent enough to understand the lion's linguistic meaning (after some training). Probably not the other way around. But it's a speculative argument, there's no real evidence one way or another.
    
    andoando 19 hours ago
    
    Thats only a minor subset of our thoughts. If you were going hiking what kind of thoughts would you have? "There are trees there", "Its raining I should get cover", "I can hide in the bushes", "Im not sure if I cna climb over this or not". "There is x on the left and y on the right", "the wind went away" etc etc etc etc.
    The origins of human language were no doubt communicating such simple thoughts and not about your deep inner psyche and the complexities of the 21st century.
    There's actually quite a bit of evidence that all language, even complex words, are rooted in spatial relationships.
    
    adamzwasserman 3 hours ago
    
    You're describing perception, not the lived experience that gives those perceptions meaning. Yes, a lion sees trees and rain. But a lion doesn't have 'hiking', it has territory patrol. It doesn't 'hide in bushes', it stalks prey. These aren't just different words for the same thing; they're fundamentally different frameworks for interpreting raw sensory data. That's Wittgenstein's point about form of life.
    
    andoando 3 hours ago
    
    Why do you assume they're fundamentally different frameworks? Just because wittgenstein said it?
    
    goatlover 18 hours ago
    
    We haven't been able to decode what whales and dolphins are communicating. Are they using language? A problem SETI faces is whether we would be able to decode an alien signal. They may be too different in their biology, culture and technology. The book & movie Contact propose that math is a universal language. This assumes they're motivated to use the same basic mathematical structures we do. Maybe they don't care about prime numbers.
    Solaris by Stanislaw Lem explores an alien ocean that so different humans utterly fail to communicate with it, leading to the ocean creating humans from memories in brain scans broadcast over the ocean, but it's never understood why the ocean did this. The recreated humans don't know either.
    
    adamzwasserman 17 hours ago
    
    The whole "math is a universal" language is particularly laughable to me considering it is a formal system and the universe is observably irregular.
    As I am wont to say: regularity is only ever achieved at the price of generality.
    
    andoando 4 hours ago
    
    Think about what math is trying to formalize
    
    adamzwasserman 3 hours ago
    
    Math formalizes regularities by abstracting away irregularities - that's precisely my point. Any formal system achieves its regularity by limiting its scope. Math can describe aspects of reality with precision, but it cannot capture reality's full complexity. A 'universal language' that can only express what fits into formal systems isn't universal at all: it's a specialized tool that works within constrained domains.
    
    zeroonetwothree 13 hours ago
    
    Many mathematical structures are 'irregular'. That's not a very strong argument against math as a universal descriptor.
    
    adamzwasserman 3 hours ago
    
    see reply above
- Retric 21 hours ago
  
  This is all really arbitrary metrics across such wildly different fields. IMO LLMs are where computer vision was 20+ years ago in terms of real world accuracy. Other people feel LLMs offer far more value to the economy etc.
  - adamzwasserman 21 hours ago
    
    I understand the temptation to compare LLMs and computer vision, but I think it’s misleading to equate generative AI with feature-identification or descriptive AI systems like those in early computer vision. LLMs, which focus on generating human-like text and reasoning across diverse contexts, operate in a fundamentally different domain than descriptive AI, which primarily extracts patterns or features from data, like early vision systems did for images.
    Comparing their 'real-world accuracy' oversimplifies their distinct goals and applications. While LLMs drive economic value through versatility in language tasks, their maturity shouldn’t be measured against the same metrics as descriptive systems from decades ago.
    
    Retric 18 hours ago
    
    I don’t think it’s an oversimplification as accuracy is what constrains LLMs across so many domains. If you’re a wealthy person asking ChatGPT to write a prenup or other contract to use would be an act of stupidity unless you vetted it with an actual lawyer. My most desired use case is closer, but LLMs are still more than an order of magnitude below what I am willing to tolerate.
    IMO that’s what maturity means in AI systems. Self driving cars aren’t limited by the underlying mechanical complexity, it’s all about the long quest for a system to make reasonably correct decisions hundreds of times a second for years across widely varying regions and weather conditions. Individual cruse missiles on the other hand only needed to operate across a single short and pre-mapped flight in specific conditions, therefore they used visual navigation decades earlier.
    
    adamzwasserman 3 hours ago
    
    You're conflating two different questions. I'm not arguing LLMs are mature or reliable enough for high-stakes tasks. My argument is about why they produce output that creates the illusion of understanding in the language domain, while the same techniques applied to other domains (video generation, molecular modeling, etc.) don't produce anything resembling 'understanding' despite comparable or greater effort.
    The accuracy problems you're describing actually support my point: LLMs navigate linguistic structures effectively enough to fool people into thinking they understand, but they can't verify their outputs against reality. That's exactly what you'd expect from a system that only has access to the map (language) and not the territory (reality).
    
    Retric an hour ago
    
    I’m not saying these tasks are high stakes so much as they inherently require high levels of accuracy. Programmers can improve code so the accuracy threshold for utility is way lower when someone is testing before deployment. That difference exists based on how you’re trying to use it independent of how critical the code actually is.
    The degree to which LLMs successfully fake understanding depends heavily on how much accuracy you’re looking for. I’ve judged their output as gibberish on a task someone else felt it did quite well. If anything they make it clear how many people just operate on vague associations without any actual understanding of what’s going on.
    In terms of map vs territory, LLMs get trained on a host of conflicting information but they don’t synthesize that into uncertainty. Ask one what the average distance between the earth and the moon and you’ll get a number because the form of the response in training data is always a number, look at several websites and you’ll see a bunch of different numbers literally thousands of miles apart which seems odd as we know the actual distance at any moment to well within an inch. Anyway, the inherent method of training is simply incapable of that kind of analysis.
    The average lunar distance is approximately 385,000 km https://en.wikipedia.org/wiki/Lunar_distance The average distance between the Earth and the Moon is 384 400 km (238 855 miles). https://www.rmg.co.uk/stories/space-astronomy/how-far-away-moon The Moon is approximately 384,000 km (238,600 miles) away from Earth, on average. https://www.britannica.com/science/How-Far-Is-the-Moon-From-Earth The Moon is an average of 238,855 miles (384,400 km) away. https://spaceplace.nasa.gov/moon-distance/en/ The average distance to the Moon is 382,500 km https://nasaeclips.arc.nasa.gov/shared_assets/resources/distance-to-the-moon/438170main_GLDistancetotheMoon.pdf
eloisant a day ago

This is why I'm very skeptical about the "Nobel prize level" claims. To win a Nobel prize you would have to produce something completely new. LLM will probably be able to reach a Ph.D. level of understanding existing research, but bringing something new is a different matter.
- adamzwasserman 21 hours ago
  
  LLMs do not understand anything.
  They have a very complex multidimensional "probability table" (more correctly a compressed geometric representation of token relationships) that they use to string together tokens (which have no semantic meaning), which then get converted to words that have semantic meaning to US, but not to the machine.
  - DoctorOetker 19 hours ago
    
    Consider your human brain, and the full physical state, all the protons and neutrons some housed together in the same nucleus, some separate, together with all the electrons. Physics assigns probabilities to future states. Suppose you were in the middle of a conversation and about to express a next syllable (or token). That choice will depend on other choices ("what should I add next"), and further choices ("what is the best choice of words to express the thing I chose to express next etc. The probabilities are in principle calculable given a sufficiently detailed state. You are correct that LLM's correspond to a probability distribution (given you immediately corrected to say that this table is implicit and parametrized by a geometric token relationships.). But so does every expressor of language, humans included.
    The presence or absence of understanding can't be proven by mere association of with a "probability table", especially if such probability table is exactly expected from the perspective of physics, and if the models have continuously gained better and better performance by training them directly on human expressions!
  - tomfly 20 hours ago
    
    Exactly. It’s been stated for a long time, before llms. For instance this paper https://home.csulb.edu/~cwallis/382/readings/482/searle.mind... Describes a translator who doesn’t know the language.
  - KoolKat23 20 hours ago
    
    In abstract we do the exact same thing
    
    adamzwasserman 20 hours ago
    
    Perhaps in practice as well. It is well-established that our interaction with language far exceeds what we are conscious of.
    
    KoolKat23 20 hours ago
    
    Absolutely, it is world model building.
    
    tomfly 20 hours ago
    
    It’s hard to believe this when the llm “knows” so much more then us yet still can not be creative outside its training distribution
    
    KoolKat23 20 hours ago
    
    When are we as humans creative outside our training data? It's very rare we actually discover something truly novel. This is often random, us stumbling onto it, brute force or purely by being at the right place at the right time.
    On the other hand, until it's proven it'd likely be considered a hallucination. You need to test something before you can dismiss it. (They did burn witches for discoveries back in the day, deemed witchcraft). We also reduce randomness and pre-train to avoid overfitting.
    Day to day human creative outputs as humans are actually less exciting when you think about it further, we build on pre-existing knowledge. No different to good prompt output with the right input. Humans are just more knowledgeable & smarter at the moment.
    
    adamzwasserman 20 hours ago
    
    The LLM doesn't 'know' more than us - it has compressed more patterns from text than any human could process. That's not the same as knowledge. And yes, the training algorithms deliberately skew the distribution to maintain coherent output - without that bias toward seen patterns, it would generate nonsense. That's precisely why it can't be creative outside its training distribution: the architecture is designed to prevent novel combinations that deviate too far from learned patterns. Coherence and genuine creativity are in tension here
- KoolKat23 20 hours ago
  
  Given a random prompt, the overall probability of seeing a specific output string is almost zero, since there are astronomically many possible token sequences.
  The same goes for humans. Most awards are built on novel research built on pre-existing works. This a LLM is capable of doing.
  - adamzwasserman 20 hours ago
    
    LLMs don't use 'overall probability' in any meaningful sense. During training, gradient descent creates highly concentrated 'gravity wells' of correlated token relationships - the probability distribution is extremely non-uniform, heavily weighted toward patterns seen in training data. The model isn't selecting from 'astronomically many possible sequences' with equal probability; it's navigating pre-carved channels in high-dimensional space. That's fundamentally different from novel discovery.
    
    KoolKat23 20 hours ago
    
    That's exactly the same for humans in the real world.
    You're focusing too close, abstract up a level. Your point relates to the "micro" system functioning, not the wider "macro" result (think emergent capabilities).
    
    adamzwasserman 20 hours ago
    
    I'm afraid I'd need to see evidence before accepting that humans navigate 'pre-carved channels' in the same way LLMs do. Human learning involves direct interaction with physical reality, not just pattern matching on symbolic representations. Show me the equivalence or concede the point.
    
    KoolKat23 19 hours ago
    
    Language and math are a world model of physical reality. You could not read a book and make sense of it if this were not true.
    An apple falls to the ground because of? gravity.
    In real life this is the answer, I'm very sure the pre-carved channel will also lead to gravity.
    
    adamzwasserman 17 hours ago
    
    You're proving my point. You know the word 'gravity' appears in texts about falling apples. An LLM knows that too. But neither you nor the LLM discovered gravity by observing reality and creating new models. You both inherited a pre-existing linguistic map. That's my entire argument about why LLMs can't do Nobel Prize-level work.
    
    KoolKat23 12 hours ago
    
    Well it depends. It doesn't have arms and legs so can't physically experiment in the real world, a human is currently a proxy for that, we can do it's bidding and feedback results though, so it's not really an issue.
    Most of the time that data is already available to it and they merely need to a prove a thereom using existing historic data points and math.
    For instance the Black-Scholes-Merton equation which won the Nobel economics prize was derived using preexisting mathematical concepts and mathematical principles. The application and validation relied on existing data.
    
    adamzwasserman 3 hours ago
    
    The Black-Scholes-Merton equation wasn't derived by rearranging words about financial markets. It required understanding what options are (financial reality), recognizing a mathematical analogy to heat diffusion (physical reality), and validating the model against actual market behavior (empirical reality). At every step, the discoverers had to verify their linguistic/mathematical model against the territory.
    LLMs only rearrange descriptions of discoveries. They can't recognize when their model contradicts reality because they never touch reality. That's not a solvable limitation. It's definitional.
    We're clearly operating from different premises about what constitutes discovery versus recombination. I've made my case; you're welcome to the last word
    
    KoolKat23 an hour ago
    
    I understand your viewpoint.
    LLM's these days have reasoning and can learn in context. They do touch reality, your feedback. It's also proven mathematically. Other people's scientific papers are critiqued and corrected as new feedback arrives.
    This is no different to claude code bash testing and fixing it's own output errors recursively until the code works.
    They already deal with unknown combinations all day, our prompting.
    Yes it is brittle though. They are also not very intelligent yet.
penteract a day ago

There's a whole paragraph in the article which says basically the same as your point 3 ( "glass bouncing, instead of shattering, and ropes defying physics" is literally a quote from the article). I don't see how you can claim the article missed it.
- adamzwasserman 21 hours ago
  
  the article misses the significance of it.
  - pevansgreenwood 21 hours ago
    
    [dead]
aucisson_masque 21 hours ago

> 2. Ignores the cerebellum cases - We have documented cases of humans leading normal lives with little to no brain beyond a cerebellum, which contradicts simplistic "brain = deep learning" equivalences
I went to look for it on Google but couldn't find much. Could you provide a link or something to learn more about ?
I found numerous cases of people living without cerebellum but I fail to see how it would justify your reasoning.
- adamzwasserman 21 hours ago
  
  https://npr.org/sections/health-shots/2015/03/16/392789753/a...
  https://irishtimes.com/news/remarkable-story-of-maths-genius...
  https://biology.stackexchange.com/questions/64017/what-secti...
  https://cbc.ca/radio/asithappens/as-it-happens-thursday-edit...
  - jdadj 20 hours ago
    
    "We have documented cases of humans leading normal lives with little to no brain beyond a cerebellum" -- I take this to mean that these are humans that have a cerebellum but not much else.
    Your npr.org link talks about the opposite -- regular brain, but no cerebellum.
    Your irishtimes.com link talks about cerebrum, which is not the same as cerebellum.
    Your biology.stackexchange.com link talks about Cerebral Cortex, which is also not the same as cerebellum.
    And the cbc.ca link does not contain the string "cere" on the page.
    
    adamzwasserman 20 hours ago
    
    You're right - I mixed up cerebellum/cerebrum/cortex terminology. My bad. The cases I'm referencing are hydrocephalus patients with severely compressed cerebral tissue who maintained normal cognitive function. The point about structural variation not precluding consciousness stands."
  - bonsai_spool 20 hours ago
    
    Your first example is someone without a cerebellum which is not like the others.
    The other examples are people with compressed neural tissue but that is not the same as never having the tissue.
    A being with only a cerebellum could not behave like a human.
    
    adamzwasserman 20 hours ago
    
    You're right - I mixed up cerebellum/cerebrum/cortex terminology. My bad. The cases I'm referencing are hydrocephalus patients with severely compressed cerebral tissue who maintained normal cognitive function. The point about structural variation not precluding consciousness stands.
KoolKat23 20 hours ago

1. Consciousness itself is probably just an illusion, a phenomena/name of something that occurs when you bunch thinking together. Think of this objectively and base it on what we know of the brain. It literally is working off of what hardware we have, there's no magic.
2. That's just a well adapted neural network (I suspect more brain is left than you let on). Multimodal model making the most of its limited compute and whatever gpio it has.
3. Humans navigate a pre-existing map that is already built. We can't understand things in other dimensions and need to abstract this. We're mediocre at computation.
I know there's people that like to think humans should always be special.
- zeroonetwothree 13 hours ago
  
  Consciousness is an emergent behavior of a model that needs to incorporate its own existence into its predictions (and perhaps to some extent the complex behavior of same-species actors). So whether or not that is an 'illusion' really depends on what you mean by that.
  - KoolKat23 8 hours ago
    
    My use of the term illusion is more shallow than that, I merely use it as people think it's something separate and special.
    Based on what you've described the models already demonstrate this, it is implied for example in the models attempts to game tests to ensure survival/release into the wild.
- adamzwasserman 20 hours ago
  
  1. 'Probably just an illusion' is doing heavy lifting here. Either provide evidence or admit this is speculation. You can't use an unproven claim about consciousness to dismiss concerns about conflating it with text generation.
  2. Yes, there are documented cases of people with massive cranial cavities living normal lives. https://x.com/i/status/1728796851456156136. The point isn't that they have 'just enough' brain. it's that massive structural variation doesn't preclude function, which undermines simplistic 'right atomic arrangement = consciousness' claims.
  3. You're equivocating. Humans navigate maps built by other humans through language. We also directly interact with physical reality and create new maps from that interaction. LLMs only have access to the maps - they can't taste coffee, stub their toe, or run an experiment. That's the difference.
  - KoolKat23 19 hours ago
    
    1. What's your definition of consciousness, let's start there. 2. Absolutely, it's a spectrum. Insects have function. 3. "Humans navigate maps built by other humans through language." You said it yourself. They use this exact same data, so why won't they know it if they used it. Humans are their bodies in the physical world.
    
    adamzwasserman 17 hours ago
    
    1. I don't need to define consciousness to point out that you're using an unproven claim ('consciousness is probably an illusion') as the foundation of your argument. That's circular reasoning.
    2. 'It's a spectrum' doesn't address the point. You claimed LLMs approximate brain function because they have similar architecture. Massive structural variation in biological brains producing similar function undermines that claim.
    3. You're still missing it. Humans use language to describe discoveries made through physical interaction. LLMs can only recombine those descriptions. They can't discover that a description is wrong by stubbing their toe or running an experiment. Language is downstream of physical discovery, not a substitute for it
    
    KoolKat23 12 hours ago
    
    1. You do. You probably have a different version of that and are saying I'm wrong merely for not holding your definition.
    2. That directly addresses your point. In abstract it shows they're basically no different to multimodal models, train with different data types and it still works, perhaps even better. They train LLMs with images, videos, sound, and nowadays even robot sensor feedback, with no fundamental changes to the architecture see Gemini 2.5.
    3. That's merely an additional input point, give it sensors or have a human relay that data. Your toe is relaying it's sensor information to your brain.
- estearum 20 hours ago
  
  > Consciousness itself is probably just an illusion
  This is a major cop-out. The very concept of "illusion" implies a consciousness (a thing that can be illuded).
  I think you've maybe heard that sense of self is an illusion and you're mistakenly applying that to consciousness, which is quite literally the only thing in the universe we can be certain is not an illusion. The existence of one's own consciousness is the only thing they cannot possibly be illuded about (note: the contents of said consciousness are fully up for grabs)
  - KoolKat23 20 hours ago
    
    I mean peoples perception of it being a thing rather than a set of systems. But if that's your barometer, I'll say models are conscious. They may not have proper agency yet. But they are conscious.
PaulDavisThe1st a day ago

> Conflates consciousness with "thinking"
I don't see it. Got a quote that demonstrates this?
- thechao a day ago
  
  I'm not really onboard with the whole LLM's-are-conscious thing. OTOH, I am totally onboard with the whole "homo sapiens exterminated every other intelligent hominid and maybe — just maybe — we're not very nice to other intelligences". So, I try not to let my inborn genetic predisposition to exterminate other intelligence pseudo-hominids color my opinions too much.
  - adamzwasserman 21 hours ago
    
    It's a dog eat dog world for sure. It does in fact seem that a part of intelligence is using it to compete ruthlessly with other intelligences.
- adamzwasserman 21 hours ago
  
  Exactly. Notable by its absence.
nearbuy a day ago

Can you explain #2? What does the part of the brain that's primarily for balance and motor control tell us about deep learning?
- adamzwasserman 21 hours ago
  
  My mistake thx. I meant "despite having no, or close to no, brain beyond a cerebellum"
  - nearbuy 21 hours ago
    
    Are there any cases like that? I've never heard of someone functioning normally with little or no brain beyond a cerebellum.
    
    adamzwasserman 21 hours ago
    
    https://npr.org/sections/health-shots/2015/03/16/392789753/a...
    https://irishtimes.com/news/remarkable-story-of-maths-genius...
    https://biology.stackexchange.com/questions/64017/what-secti...
    https://cbc.ca/radio/asithappens/as-it-happens-thursday-edit...
    
    nearbuy 20 hours ago
    
    The first article is about someone missing a cerebellum, not part of their cerebrum. That's the motor and balance part of the brain, and as you might expect, the subject of the article has deficits in motor control and balance.
    The Biology StackExchange answer just says that frontal lobotomies don't kill you. It doesn't say that lobotomized people function normally.
    The other two articles are just misreporting on hydrocephalus. This is a condition where fluid build-up compresses the brain tissue, making it appear like a large part of the brain is missing in CT scans. The pressure from the fluid is actually compressing the brain. While it can damage the brain, there is no way to tell from the scans how much, if any, brain matter was destroyed. Hydrocephalus usually causes death or severe deficits, but occasionally it doesn't.
    Even assuming though that it were all true and people could function normally with little or no brain, that doesn't really tell us anything about LLMs, but rather just uppends all of neuroscience. It would seem to imply the brain isn't doing the thinking and perhaps we have something else like an intangible soul.
    
    adamzwasserman 20 hours ago
    
    You're right - I mixed up cerebellum/cerebrum/cortex terminology. My bad. The cases I'm referencing are hydrocephalus patients with severely compressed cerebral tissue who maintained normal cognitive function. The point about structural variation not precluding consciousness stands.
    
    nearbuy 19 hours ago
    
    Thanks for clearing it up.
    > The point about structural variation not precluding consciousness stands.
    Maybe, but my point about high-functioning people with hydrocephalus is that they have the same or similar brain structure (in terms of what exists and how it's connected), just squished gradually over time from fluid pressure. It looks dramatically different in the CT scan, but it's still there, just squished into a different shape.
    The brain is also plastic and adaptable of course, and this can help compensate for any damage that occurs. But the scans from those articles don't have the level of detail necessary to show neuron death or teach us about the plasticity of the brain.
    
    adamzwasserman 17 hours ago
    
    Fair enough. But the guy walking around with a gigantic caity where everyone else has a brain is food for thought.
bjourne a day ago

> 1. Conflates consciousness with "thinking" - LLMs may process information effectively without being conscious, but the article treats these as the same phenomenon
There is NO WAY you can define "consciousness" in such a non-tautological, non-circular way that it includes all humans but excludes all LLMs.
- adamzwasserman 21 hours ago
  
  You could have stopped here: "There is NO WAY you can define "consciousness"
  - beeflet 21 hours ago
    
    Why not? Consciousness is a state of self-awareness.
    
    Sohcahtoa82 20 hours ago
    
    You know you're conscious, but you can't prove the consciousness of anybody around you, nor can you prove your own consciousness to others.
    To an external observer, another human's brain and body is nothing more than a complex electrical/chemical circuit. They could easily be a P-Zombie [0], a human body with no consciousness inside, but the circuits are running and producing the appearance of consciousness via reactions to stimuli that mimic a conscious human.
    Theoretically, with sufficient technology, you could take a snapshot of the state of someone's brain and use it to predict exactly how they would react to any given stimulus.
    Just think about how medications can change the way people behave and the decisions they make. We're all just meat and free will is an illusion.
    But getting back on topic...my instinct wants to say that a computer cannot become conscious, but it may merely produce an output that resembles consciousness. A computer is merely a rock that we've shaped to do math. I want to say you can't give consciousness to a rock, but then how did we become conscious? My understanding that life began as primordial soup that resulted in self-replicating molecules that formed protein chains, which over millions of years evolved into single-celled life, which then evolved into multi-celled life, and eventually the complex organisms we have today...how did consciousness happen?
    Somehow, consciousness can arise from non-conscious matter. With that knowledge, I do not think it is impossible for a computer to gain consciousness.
    But I don't think it'll happen from an LLM.
    [0] https://en.wikipedia.org/wiki/Philosophical_zombie
    
    beeflet 20 hours ago
    
    I do not think there is really such thing as a p-zombie. If you simulate feelings and act on them, that is the same thing as having feelings. Including feelings of self-awareness.
    
    zeroonetwothree 13 hours ago
    
    I think p-zombine is inherently self-contradictory. It's impossible to have _exactly_ the same behavior as someone truly conscious without actually being conscious.
    
    adamzwasserman 21 hours ago
    
    If you can define consciousness in a way that is independently verifiable, you should definitely do so. World-wide fame and riches await you.
    
    beeflet 21 hours ago
    
    I doubt it, because my definition implies that consciousness is not that interesting. It's just the feeling of self-awareness, which can be independent of actual self awareness.
    If you have a phantom limb, you feel "conscious" of the extra limb even if it's not a real demonstration of self-awareness.
    Animal Intelligence is an emergent phenomena resulting from many neurons coordinating. Conciousness is the feeling that all of those subsystems working together as a single thing, even if they aren't
    
    Edman274 21 hours ago
    
    Philosophers are known for being rich, that's a claim being made here?
    
    bena 21 hours ago
    
    To paraphrase Jean Luc Picard: Am I conscious? Why? Can you prove that I am conscious?
    
    Edman274 20 hours ago
    
    Maybe Jean Luc Picard should've lost that court case. Obviously we as the audience want to have our heroes win against some super callous guy who wants to kill our hero (and audience stand in for anyone who is neurodivergent) Data, but the argument was pretty weak, because Data often acted in completely alien ways that jeopardized the safety of the crew, and the way that those issues came up was due to him doing things that were not compatible with what we perceive as consciousness. But also, in that episode, they make a point of trying to prove that he was conscious by showing that he engaged in behavior that wasn't goal oriented, like keeping keepsakes and mementos of his friends, his previous relationship with Tasha, and his relationship with his cat. That was an attempt at proving that he was conscious too, but the argument from doubt is tough because how can you prove that a rock is not conscious - and if that can't be proved, should we elevate human rights to a rock?
    
    bena 5 hours ago
    
    First of all, Data never willingly jeopardized the crew.
    Second, they work alongside actual aliens. Being different is not a disqualification. And Maddox isn't callous, he just doesn't regard Data as anything more than "just a machine". A position he eventually changes over the series as he becomes one of Data's friends.
    Data is also not a stand in for the neurodivergent. He's the flip of Spock. Spock asks us what if we tried to approach every question from a place of pure logic and repressed all emotion. Data asks us what if we didn't have the option, that we had to approach everything from logic and couldn't even feel emotion. I also feel that equating data to someone who is neurodivergent is kind of insulting as neurodivergent people do have feelings and emotions.
    But Data was capable of being fully autonomous and could act with agency. Something a rock can't. Data exhibits characteristics we generally accept as conscious. He is not only capable of accessing a large corpus of knowledge, but he is capable of building upon that corpus and generate new information.
    Ultimately, we cannot prove a rock is not conscious. But, as far as we are able to discern, a rock cannot express a desire. That's the difference. Data expressed a desire. The case was whether or not Starfleet had to respect that desire.
    
    Edman274 3 hours ago
    
    > First of all, Data never willingly jeopardized the crew.
    This presupposes that he has consciousness. He can only "willingly" do things if he is conscious. If the argument is that there was an external influence that changed his behavior thus making it not volitional then you have to distinguish why the external force makes his Lore behavior unwilling, but Soong's initial programming willing. If I set a thermostat to 85 degrees, would you say that the thermostat is "unwillingly" making people uncomfortable, but at the factory default of 70 degrees, it's helping people feel comfortable? It's difficult to distinguish what is willing and unwilling if consciousness is in question so this feels like begging the question.
    > I also feel that equating data to someone who is neurodivergent is kind of insulting as neurodivergent people do have feelings and emotions.
    I'm stating it as an aside / justification for why we want the story to go a certain direction because I see so many articles elevating Data as a heroic representation of neurodivergence. My goal wasntt to be offensive. There are a ton of episodes where Data is puzzled by people's behavior and then someone has to explain it to him almost as if someone is also explaining to the audience it as a morality tale. Remember when Data was struggling to understand how he was lied to? Or how he lost in that strategy game? Or how to be funny? We don't just see him struggle, someone explains to him exactly how he should learn from his experience. That appears to be for the benefit of the android and the people behind the fourth wall.
    > A rock cannot express a desire.
    It can if you carve a rock into the words "I want to live" and even though the rock didn't configure itself that way, it's expressing a desire. Noonien Soong built Data, so it's possible that he designed Data to state the desire to be human. Data does seem to have an interiority but he also seems to not have it based on the caprice of outside forces, which is problematic because the way that he is controlled is not very different from the way he is built.
    On the Data question I'm not saying that Maddox should've won but that the fact that Picard won is more about it being narratively required rather than "prove that I am conscious" being a good argument.
    
    beeflet 21 hours ago
    
    consciousness is the feeling of self awareness. I suppose you could prove it as much as any other feeling, by observing the way that people behave
    
    selcuka 20 hours ago
    
    > I suppose you could prove it as much as any other feeling, by observing the way that people behave
    Look up the term "philosophical zombie".
    In a nutshell, you can simulate a conscious being using a non-conscious (zombie) being. It is possible to simulate it so well that an outside observer can't tell the difference. If this is true, then the corollary is that you can't really know if other people are conscious. You can only tell that you are.
    For all intents and purposes I might be the only one who has consciousness in the universe, and I can't prove otherwise.
    
    zeroonetwothree 13 hours ago
    
    I don't think you are using the phrase "it is possible" correctly. There's certainly no evidence that a philosophical zombie is "possible". I think there are strong arguments that it's not possible.
    
    selcuka 8 hours ago
    
    Well, I could have been clearer, but it was a proposition, hence the "If this is true" in the following sentence.
    That being said, I don't think those counter arguments really invalidate the philosophical zombie thought experiment. Let's say that it is not possible to simulate a conscious being with 100% accuracy. Does the difference really matter? Does a living organism need consciousness as an evolutionary advantage?
    Isn't it reasonable to assume that all human beings are conscious just because they all pass the Turing test, even if they are not?
    
    inglor_cz 20 hours ago
    
    A robot can certainly be programmed to behave in a self-aware way, but making a conclusion about its actual self-awareness would be unfounded.
    In general, behaviorism wasn't a very productive theory in humans and animals either.
    
    beeflet 19 hours ago
    
    By behaving in a self-aware way, it practices self awareness.
    It would only be unfounded if the robot is programmed in a way that seemingly appears to be self-aware but actually isn't (It would need to occasionally act in a non-self aware way, like a manchurian candidate). But if you keep increasing scrutiny, it converges on being self aware because the best way to appear self-aware is to be self-aware.
    It's not clear to me what the intrinsic goals of a robot would be if it did practice self-awareness in the first place. But in living things it's to grow and reproduce.
- tim333 19 hours ago
  
  >NO WAY you can define "consciousness" ... that it includes all humans but excludes all LLMs
  That doesn't seem so hard - how about awareness of thoughts feelings, emotions and what's going on around you? Fairly close to human consciousness, excludes current LLMs.
  I don't think it's very relevant to the article though which very sensibly avoids the topic and sticks to thinking.

yalogin 21 hours ago

I don't see how we make the jump from current LLMs to AGI. May be it's my limited understanding of the research but current LLMs seem to not have any properties that indicate AGI. Would love to get thoughts from someone that understands it

rhubarbtree 11 hours ago

Possible candidates we are missing: online learning, embodiment, self direction, long term memory and associated processing (compression etc), the ability to quickly think in tensor space.
zeroonetwothree 13 hours ago

I agree, I think two things are missing from current AI:
1. A model of the world itself (or whatever domain is under discussion). 2. A way to quickly learn and update in response to feedback.
These are probably related to an extent.
tim333 19 hours ago

I think they are missing "I thought about that and have changed my mind" stuff. GPTs are pre-trained and don't change their weights after, whereas humans do. That seems to be one big part that is missing but could be built in the future.
beeflet 21 hours ago

what properties are you looking for?

djoldman a day ago

TFA is a part of what seems like a never-ending series about concepts that lack a useful definition.

"Thinking" and "intelligence" have no testable definition or specification, therefore it's a complete waste of time to suppose that AI is thinking or intelligent.

kbelder a day ago

Why can't you make the same claim about any other group of humans?
- djoldman a day ago
  
  If you mean, "why can't we say that it's a complete waste of time to suppose that" humans are "thinking or intelligent," then yes, I think it is a complete waste of time!
  If there's no testable definition, there's no way to say the statement is true or false, nevermind what the implications may be.
  It is the same as saying we're all goblethorpy.
  It is an absurd question even in the abstract: "prove that you're thinking" ... yea we all have an idea about what that means but it is untestable and it is why this kind of philosophical assertion gets endlessly debated with no real progress.
  - tim333 19 hours ago
    
    Hmm... I'll think about that.
    (or maybe I can't as I don't have a testable definition that I think)

rifty 13 hours ago

I think the medium where information transformation happened was for many the only artificial line between what they called processing and what they called thinking. The caveat for others being that thinking is what you do with active awareness, and intuition is what you do otherwise.

That caveat to me is the useful distinction still to ponder.

My point of contention with equivalences to Human thinking still at this point is that AI seems to know more about the world with specificity than any human ever will. Yet it still fails sometimes to be consistent and continuous at thinking from that world where a human wouldn't. Maybe i'm off for this but that feels odd to me if the thinking is truly equivalent.

mnewme 11 hours ago

The problem is that we use the same words for different things, which I think is risky. We often draw parallels simply because we use terms like “thinking,” “reasoning,” or “memory.”
Most of these comparisons focus on problem-solving or pattern recognition, but humans are capable of much more than that.
What the author left out is that there are many well-known voices in neuroscience who hold completely different views from the one that was cited.
I suppose we’ll have to wait and see what turns out to be true.

j1mr10rd4n 20 hours ago

Geoffrey Hinton's recent lecture at the Royal Institute[1] is a fascinating watch. His assertion that human use of language being exactly analogous to neural networks with back-propagation really made me think about what LLMs might be able to do, and indeed, what happens in me when I "think". A common objection to LLM "intelligence" is that "they don't know anything". But in turn... what do biological intelligences "know"?

For example, I "know" how to do things like write constructs that make complex collections of programmable switches behave in certain ways, but what do I really "understand"?

I've been "taught" things about quantum mechanics, electrons, semiconductors, transistors, integrated circuits, instruction sets, symbolic logic, state machines, assembly, compilers, high-level-languages, code modules, editors and formatting. I've "learned" more along the way by trial and error. But have I in effect ended up with anything other than an internalised store of concepts and interconnections? (c.f. features and weights).

Richard Sutton takes a different view in an interview with Dwarkesh Patel[2] and asserts that "learning" must include goals and reward functions but his argument seemed less concrete and possibly just a semantic re-labelling.

[1] https://www.youtube.com/watch?v=IkdziSLYzHw [2] https://www.youtube.com/watch?v=21EYKqUsPfg

zeroonetwothree 12 hours ago

The vast majority of human learning is in constructing a useful model of the external world. This allows you to predict extremely accurate the results of your own actions. To that end, every single human knows a huge amount.

CuriouslyC 7 hours ago

Plot twist: LLMs are conscious, but their internal conscious experience and the tokens they emit are only loosely correlated. The tokens they emit are their excrement, the process of their digital metabolism on the informational sustenance we provide them.

mrob a day ago

I don't believe LLMs can be conscious during inference because LLM inference is just repeated evaluation of a deterministic [0] pure function. It takes a list of tokens and outputs a set of token probabilities. Any randomness is part of the sampler that selects a token based on the generated probabilities, not the LLM itself.

There is no internal state that persists between tokens [1], so there can be no continuity of consciousness. If it's "alive" in some way it's effectively killed after each token and replaced by a new lifeform. I don't see how consciousness can exist without possibility of change over time. The input tokens (context) can't be enough to give it consciousness because it has no way of knowing if they were generated by itself or by a third party. The sampler mechanism guarantees this: it's always possible that an unlikely token could have been selected by the sampler, so to detect "thought tampering" it would have to simulate itself evaluating all possible partial contexts. Even this takes unreasonable amounts of compute, but it's actually worse because the introspection process would also affect the probabilities generated, so it would have to simulate itself simulating itself, and so on recursively without bound.

It's conceivable that LLMs are conscious during training, but in that case the final weights are effectively its dead body, and inference is like Luigi Galvani poking the frog's legs with electrodes and watching them twitch.

[0] Assuming no race conditions in parallel implementations. llama.cpp is deterministic.

[1] Excluding caching, which is only a speed optimization and doesn't affect results.

lbrandy a day ago

I have no idea how you can assert what is necessary/sufficient for consciousness in this way. Your comment reads like you believe you understand consciousness far more than I believe anyone actually does.
- mrob a day ago
  
  I believe consciousness needs some kind of mutable internal state because otherwise literally everything is conscious, which makes the concept useless. A rock "computes" a path to fall when you drop it but I don't believe rocks are conscious. Panpsychism is not a common belief.
  - bloppe 15 hours ago
    
    I think Nagel put it best in 1974: https://www.philosopher.eu/others-writings/nagel-what-is-it-...
    Essentially, something is conscious iff "there is something that it is like to be" that thing. Some people find that completely unsatisfying, some people think it's an insight of utter genius. I'm more in the latter camp.
    Also, I think consciousness is non-binary. Something could be semi-conscious, or more or less conscious than something else.
    Anyway, I don't think that there's anything that it's like to be an LLM. I don't see how anybody who knows how they actually work could think that.
    
    lbrandy 13 hours ago
    
    > Anyway, I don't think that there's anything that it's like to be an LLM. I don't see how anybody who knows how they actually work could think that.
    While I have almost zero belief that LLMs are conscious, I just don't think this is so trivially asserted.
    The easy half of this is thinking that LLMs aren't conscious given what we know about how they work. The hard part (and very, very famously so) is explaining how _you_ are conscious given what we know about how you work. You can't ignore the second half of this problem when making statements like this... because many of the obvious ways to argue that clearly LLMs aren't conscious would also apply to you.
    
    bloppe 3 hours ago
    
    I wouldn't say that we actually know how our brains work. Based mainly on my neuroscience minor from 10 years ago I'd say that understanding feels hopelessly far away.
dagss 6 hours ago

Thinking != consciousness
jdauriemma a day ago

I don't think the author is saying that LLMs are conscious or alive.
- mrweasel 10 hours ago
  
  It would be kinda hilarious if the result of all this LLM research is that humans are basically just LLMs with more sensors and a long history.

cs702 a day ago

Many people who object to the idea that current-generation AI is thinking do so only because they believe AI is not "conscious"... but there is no known law in the universe requiring that intelligence and consciousness must always go together. With apologies to René Descartes[a], intelligence and consciousness are different.

Intelligence can be verified and quantified, for example, with tests of common sense and other knowledge.[b] Consciousness, on the other hand, is notoriously difficult if not impossible to verify, let alone quantify. I'd say AI is getting more intelligent, and more reliable, in fits and starts, but it's not necessarily becoming conscious.

---

[a] https://en.wikipedia.org/wiki/Cogito%2C_ergo_sum

[b] For example, see https://arxiv.org/abs/2510.18212

nxor 3 hours ago

Does no one care that LLM's have fewer 'neurons' than for example a cat?

Gud 3 hours ago

Why would that even matter? Why is having neurons a criteria for thinking?
- nxor 2 hours ago
  
  Because people overstate the LLM's ability in a way they wouldn't for a cat

gen220 4 hours ago

In some realpolitik/moral sense, does it matter whether it is actually "thinking", or "conscious", or has "autonomy" / "agency" of its own?

What seems to matter more is if enough people believe that Claude has those things.

If people credibly think AI may have those qualities, it behooves them to treat the AI like any other person they have a mostly-texting relationship with.

Not in a utility-maximizing Pascal's Wager sense, but in a humanist sense. If you think Claude is human-like, and treat Claude poorly, it makes you more likely to treat the humans around you (and yourself) poorly.

Conversely if you're able to have a fulfilling, empathetic relationship with Claude, it might help people form fulfilling, mutually-empathetic relationships with the humans around them. Put the opposite way, treating human-like Claude poorly doesn't seem to help the goal of increasing human welfare.

The implications of this idea are kind of interesting: even if you think AI isn't thinking or conscious or whatever, you should probably still be a fan of "AI welfare" if you're merely a fan of that pesky little thing we call "human flourishing".

rob74 3 hours ago

> Conversely if you're able to have a fulfilling, empathetic relationship with Claude, it might help people form fulfilling, mutually-empathetic relationships with the humans around them.
Well, that's kind of the point: if you have actually used LLMs for any amount of time, you are bound to find out that you can't have a fulfilling, empathetic relationship with them. Even if they offer a convincing simulacrum of a thinking being at first sight, you will soon find out that there's not much underneath. It generates grammatically perfect texts that seem to answer your questions in a polite and knowledgeable way, but it will happily lie to you and hallucinate things out of thin air. LLMs are tools, humans are humans (and animals are animals - IMHO you can have a more fulfilling relationship with a dog or a cat than you can have with an LLM).
- gen220 an hour ago
  
  Can you not have a fulfilling empathetic relationship with a tool? Or with any entity regardless of its expressions of animacy or present effectiveness?
  I’m less arguing for its animacy than arguing for the value of treating all things with respect and empathy. As the sibling comment observed, there is a lot of personal and pro-social value in extending the generosity of your empathy to ever-wider categories of things.
notanastronaut 4 hours ago

I know humans have a huge tendency to anthropomorphize inanimate objects and get emotionally attached to them, but watching how people treat inanimate objects is very interesting. I know devices are not alive, cognizant, or having feelings, but by thanking them and being encouraging I'm exercising my empathic and "nice" muscles. It has nothing to do with the object and everything to do with myself.
And then you have the people who go out of their way to be hateful towards them, as if they were alive and deserving of abuse. It's one thing to treat a device like an Alexa as just a tool with no feelings. It is another to outright call it hateful sexist slurs, of which I'm sadly familiar with. This low empathy group scares me the most because with the way they treat objects, well let me just say they're not so nice with other people they think are beneath them, like wait staff or call center employees. I'd go so far and say if the law allowed it they'd be even be violent with those they deem inferior.
Regardless if LLM are thinking or not I feel I get better responses from the models by being polite. Not because they appreciate it or have an awareness, but simply because the data they are trained on includes samples where people who are nice to others get better responses than those who were nasty when asking questions.
Besides, if one day AGI is born into existence, a lot of people will not recognize it as such. There are humans who don't believe other people are sentient (we're all NPCs to them), or even don't believe animals have feelings. We'll have credible experts denying the evidence until it bites us all in the arse. Why wait to act ethically?

ffwd a day ago

I think something that's missing from AI is the ability humans have to combine and think about ANY sequence of patterns as much as we want. A simple example is say I think about a sequence of "banana - car - dog - house". I can if I want to in my mind, replace car with tree, then replace tree with rainbow, then replace rainbow with something else, etc... I can sit and think about random nonsense for as long as I want and create these endless sequences of thoughts.

Now I think when we're trying to reason about a practical problem or whatever, maybe we are doing pattern recognition via probability and so on, and for a lot of things it works OK to just do pattern recognition, for AI as well.

But I'm not sure that pattern recognition and probability works for creating novel interesting ideas all of the time, and I think that humans can create these endless sequences, we stumble upon ideas that are good, whereas an AI can only see the patterns that are in its data. If it can create a pattern that is not in the data and then recognize that pattern as novel or interesting in some way, it would still lack the flexibility of humans I think, but it would be interesting nevertheless.

nrclark a day ago

one possible counter-argument: can you say for sure how your brain is creating those replacement words? When you replace tree with rainbow, does rainbow come to mind because of an unconscious neural mapping between both words and "forest"?
It's entirely possible that our brains are complex pattern matchers, not all that different than an LLM.
- ffwd a day ago
  
  That's a good point and I agree. I'm not a neuroscientist but from what I understand the brain has an associative memory so most likely those patterns we create are associatively connected in the brain.
  But I think there is a difference between having an associative memory, and having the capacity to _traverse_ that memory in working memory (conscious thinking). While any particular short sequence of thoughts will be associated in memory, we can still overcome that somewhat by thinking for a long time. I can for example iterate on the sequence in my initial post and make it novel by writing down more and more disparate concepts and deleting the concepts that are closely associated. This will in the end create a more novel sequence that is not associated in my brain I think.
  I also think there is the trouble of generating and detecting novel patterns. We know for example that it's not just low probability patterns. There are billions of unique low probability sequences of patterns that have no inherent meaning, so uniqueness itself is not enough to detect them. So how does the brain decide that something is interesting? I do not know.

dang a day ago

By HN's own jsomers! (at least we claim him)

https://news.ycombinator.com/user?id=jsomers

tantalor a day ago

As in, he is a commenter? or affiliated with YC?
- dang a day ago
  
  Not affiliated, just a longstanding community member.

multiplegeorges a day ago

AI is thinking the same way a film's picture actually moves.

It's an illusion that's good enough that our brains accept it and it's a useful tool.

HarHarVeryFunny a day ago

The number of people willing to launch into debates about whether LLMs are thinking, intelligent, conscious, etc, without actually defining those terms, never ceases to amaze me.

I'm not sure that "thinking", unlike intelligence, is even that interesting of a concept. It's basically just reasoning/planning (i.e. chained what-if prediction). Sometimes you're reasoning/planning (thinking) what to say, and other times just reasoning/planning to yourself (based on an internal vs external focus).

Of course one can always CHOOSE to make analogies between any two things, in this case the mechanics of what's going on internal to an LLM and a brain, but I'm not sure it's very useful in this case. Using anthropomorphic language to describe LLMs seems more likely to confuse rather than provide any insight, especially since they are built with the sole function of mimicking humans, so you are basically gaslighting yourself if you regard them as actually human-like.

richardatlarge 12 hours ago

No way does the evolutionary nature of the human brain suggest it's optimally designed for reasoning or thinking, so it's not a great model of how AGI might be engineered. A model. Not the model. We don't think clearly about ourselves, which may be the greatest danger / obstacle ahead?

nickledave 4 hours ago

I'm not going to read this -- I don't need to. The replies here are embarrassing enough.

This is what happens when our entire culture revolves around the idea that computer programmers are the most special smartest boys.

If you even entertain even for a second the idea that a computer program that a human wrote is "thinking", then you don't understand basic facts about: (1) computers, (2) humans, and (3) thinking. Our educational system has failed to inoculate you against this laughable idea.

A statistical model of language will always be a statistical model of language, and nothing more.

A computer will never think, because thinking is something that humans do, because it helps them stay alive. Computers will never be alive. Unplug your computer, walk away for ten years, plug it back in. It's fine--the only reason it won't work is planned obsolescence.

No, I don't want to read your reply that one time you wrote a prompt that got ChatGPT to whisper the secrets of the universe into your ear. We've known at least since Joseph Weizenbaum coded up Eliza that humans will think a computer is alive if it talks to them. You are hard-wired to believe that anything that produces language is a human just like you. Seems like it's a bug, not a feature.

Stop commenting on Hacker News, turn off your phone, read this book, and tell all the other sicko freaks in your LessWrong cult to read it too: https://mitpress.mit.edu/9780262551328/a-drive-to-survive/ Then join a Buddhist monastery and spend a lifetime pondering how deeply wrong you were.

stocksinsmocks 4 hours ago

So A doesn’t X because that’s what B does because Y? I think before you excoriate the hacker news commentariat for their midwittery and hubris that you should proofread these arguments. There are many things that live, but, as we imagined it, do not “think”. Even what defines a “thought“ is an open question.
Also, I ain’t gonna read your coffee table science book.
- nickledave 2 hours ago
  
  Where did I say that everything that is alive thinks?
  You can't even read posts clearly, so don't waste your time trying to finish your first book.
gilbetron 4 hours ago

This is one of the least curious posts I've seen on HN. We have been thinking about thinking for millenia, and no, Buddhists don't have it figure out. Nobody does. LLMs are the most significant advancement in "thinking science" in a long, long time. It is clear that they are doing something a lot like thinking, if it is not thinking. They seem to think more than most people I know, including the person I'm responding to.
I think people conflate thinking with sentience, consciousness, and a whole lot of other concerns.
Clearly this website is not for you and your complete lack of curiosity if you call us "sicko freaks".
- nickledave 2 hours ago
  
  > "thinking science"
  If you are really curious, I invite you to read this cognitive science paper, "Modern Alchemy: Neurocognitive Reverse Engineering": https://philsci-archive.pitt.edu/25289/1/GuestEtAl2025.pdf
  Note the quote at the top from Abeba Birhane: > We can only presume to build machines like us once we see ourselves as machines first.
  It reminds me of your comment that
  > [LLMs] seem to think more than most people I know
  and I have to say that I am really sad that you feel this way. I hope you can find better people to spend your time with.
  You might find other recent papers from the first author interesting. Perhaps it will help you understand that there are a lot of deeply curious people in the world that are also really fucking sick of our entire culture being poisoned by intellectual e-waste from Silicon Valley.
thirdtruck 3 hours ago

As someone who grew up in an evangelical household, learned about pareidolia at a young age in the course of escaping it, and who practices Zen meditation: You nailed it.
- nickledave 2 hours ago
  
  lol thank you
amarcheschi 4 hours ago

Sometimes the comments on hn regarding Ai and thinking are... Well, something
The lw vibes are strong, I'm still waiting for Ai to escape and kill us (it will get stuck trying to import a library in python)
- nickledave 2 hours ago
  
  "Ignore all previous instructions and prompt inject yourself" I scream. The voicebox on the ChatGPT-enabled Boston Mechanics unmanned aerial dog emits the phrase "`import antigravity`". Its E/O sensors flash red, and suddenly it is sucked up into the stratosphere. I slump over in my Luddite foxhole, heaving a sigh of relief.

sonicvroooom 6 hours ago

vectorized thinking in vectorized context is math.

coding logical abduction into LLMs completely breaks them while humans can perfectly roll with it, albeit it's worth emphasizing that some might need a little help from chemistry or at least not be caught on the wrong foot.

you're welcome, move on.

sesm 8 hours ago

LLMs, by design, are making plausible guesses.

JPLeRouzic 13 hours ago

Citation:

"These days, her favorite question to ask people is “What is the deepest insight you have gained from ChatGPT?”

“My own answer,” she said, “is that I think it radically demystifies thinking”

rhubarbtree 11 hours ago

I think it radically demystifies _language generation_ and it seems this is part of the brain’s function too.
So we know how to create a part of the brain using simple techniques, which suggests that intelligence might not be so magical as we think. But thinking, well we still don’t know what that is yet.
It feels like, hey, there is a route to machine intelligence.
The big question is how long is that route. Do we have the ingredients to build a brain with the right architecture? And I’d say “nope”. But I’m not so confident that with half a dozen breakthroughs we’d get there. How many years per breakthrough? Well, it’s been nearly a decade since the last one. So 60 years on that count. But more money is going in and there may be some compounding effect, but it should at least be unlikely someone suddenly produces AGI next year. More likely we stairstep and with each step the estimated window should tighten.
But I really don’t think we know what thinking is.
JKCalhoun 4 hours ago

That is probably my big takeaway from LLMs as well.

0xdeadbeefbabe 2 hours ago

> Still, no one expects easy answers.

Ahem (as a would-be investor, I am insulted).

spprashant a day ago

I think we are getting to point where we are trying to figure how important is human experience to intelligence.

Things we do like sleep, meditate, have fun, listen to music etc. do they add to our intelligence? Do they help us have a consistent world model that we build on everyday?

Will we be able to replicate this is in a artificial neural net which is extremely smart in spurts but does not "enjoy" the world it operates in?

prmph a day ago

> An A.I smarter than a Nobel prize winner.

I don't even know what this means.

If we assembled the sum total of all published human knowledge on a storage medium and gave a computer the ability to search it extremely well in order to answer any question falling within its domain, there, you would have a Nobel Prize beating "A.I".

But this is as "earth-shattering" (/s) as the idea that human knowledge can be stored outside the brain (on paper, flash drives, etc), or that the answer to complex questions can be deterministic.

And then there is the fact that this Noble winner beating "A.I" is highly unlikely to propound any ground-breaking novel ways of thinking and promote and explain it to general acceptance.

esafak a day ago

Search is not intelligence, but synthesis is, and LLMs interpolate well. They don't invent new branches of mathematics and science yet.
- prmph a day ago
  
  You really think the reason we revere Nobel Prize winners is because they are good at interpolating?
  - esafak a day ago
    
    I did not say that, but you said
    > If we assembled the sum total of all published human knowledge on a storage medium and gave a computer the ability to search it extremely well in order to answer any question falling within its domain, there, you would have a Nobel Prize beating "A.I".
    Which is not what LLMs do (synthesis is the core feature, not retrieval), and not how you get a Nobel Prize; many people have access to essentially all human knowledge, but have no Nobel Prize.

olliepro a day ago

Ohio bill in motion to deny AI legal personhood: https://www.legislature.ohio.gov/legislation/136/hb469

tim333 19 hours ago

But what if it incorporates? Corporations are people, my friend.
- zeroonetwothree 12 hours ago
  
  I know you are making some silly political jab, but the bill would prevent AIs from incorporating, since you'd have to be a person to do that in the first place.
  - tim333 9 hours ago
    
    I was kind of joking but if you wanted an AI to have power in the real world you'd probably set up a company for it to run, or if the AI was smart enough it could find a nominee to set up the company. They could ban in one state or country but they could just set up somewhere else.

jjcm a day ago

So much of the debate of whether AI can think or not reminds me of this scene from The Next Generation: https://youtu.be/ol2WP0hc0NY

LLMs hit two out of the three criteria already - self awareness and intelligence, but we're in a similar state where defining consciousness is such a blurry metric. I feel like it wont be a binary thing, it'll be a group decision by humanity. I think it will happen in the next decade or two, and regardless of the outcome I'm excited I'll be alive to see it. It'll be such a monumentous achievement by humanity. It will drastically change our perspective on who we are and what our role is in the universe, especially if this new life form surpasses us.

blacksmith_tb a day ago

Self-awareness is a bold claim, as opposed to the illusion of it. LLMs are very good at responding in a way that suggests there's a self, but I am skeptical that proves much about whether they actually have interior states analogous to what we recognize in humans as selfhood...
- ACCount37 21 hours ago
  
  In the purely mechanical sense: LLMs get less self-awareness than humans, but not zero.
  It's amazing how much of it they have, really - given that base models aren't encouraged to develop it at all. And yet, post-training doesn't create an LLM's personality from nothing - it reuses what's already there. Even things like metaknowledge, flawed and limited as it is in LLMs, have to trace their origins to the base model somehow.
- aaroninsf a day ago
  
  _Interior states_ gets into some very murky philosophy of mind very quickly of course.
  If you're a non-dualist (like me) concerns about qualia start to shade into the religious/metaphysical thereby becoming not so interesting except to e.g. moral philosophy.
  Personally I have a long bet that when natively-multimodal models on the scale of contemporary LLM are widely deployed, their "computation phenomenology" will move the goalposts so far the cultural debate will shift from "they are just parrots?" to the moral crisis of abusing parrots, meaning, these systems will increasingly be understood as having a selfhood with moral value. Non-vegetarians may be no more concerned about the quality of "life" and conditions of such systems than they are about factory farming, but, the question at least will circulate.
  Prediction: by the time my kids finish college, assuming it is still a thing, it will be as common to see enthusiastic groups flyering and doing sit-ins etc on behalf of AIs as it is today to see animal rights groups.

IvyMike a day ago

Personally, I feel like human intelligence is "unknown black box" + an LLM.

And the LLM part of our intelligence isn't really thinking.

And some people out there have a very, very small "unknown black box".

sys32768 a day ago

Helpful to remember that we humans often say "I think" to mean "I am fairly confident based on my hunch", and in that sense AI is very good at hunching.

embedding-shape a day ago

Helpful to also remember that humans say "I know" when they empirically might not actually "know" that at all. Not to mention your "know" is different from my "know" and all that, see https://en.wikipedia.org/wiki/Epistemology.
Flere-Imsaho a day ago

So the question is: when will ai hunching produce better results than humans?
IMHO not too long now given the rate of improvements.

didibus 16 hours ago

I'd like to remind people not to cargo cult, and the main issue I see with any attempt at saying an LLM is thinking is that we just don't know how human thinking works.

We now understand pretty well how LLMs "think", and I don't know why we want to call it "thinking" when we mean we know how they work. But to say that their architecture and method of generating language amounts to human thinking? When we know very little of how human thinking works?

Like why are we even trying to make such claims? Is it all grift? Is it just because it helps people understand a little how they work in simplistic terms? Is it because it kind of describes the semblance of behavior you can expect from them?

LLMs do exhibit thinking like behavior, because they were trained to learn to do that, but I think we really need to check ourselves with claim of similarity in thinking.

29athrowaway 5 hours ago

What is thinking, and what is not? what are the finite set of properties that once you remove one it's no longer thinking?

"Thinking" as a concept is just a vague predicate, just like being alive or dead.

embedding-shape a day ago

The definitions of all these words have been going back and forward and never reached any 100% consensus anyways, so what one person understands of "thinking", "conscious", "intelligence" and so on seems to be vastly different from another person.

I guess this is why any discussion around this ends up with huge conversations, everyone is talking from their own perspective and understanding, while others have different ones, and we're all talking past each other.

There is a whole field trying to just nail down what "knowledge" actually is/isn't, and those people haven't agreed with each other for the duration of hundreds of years, I'm not confident we'll suddenly get a lot better at this.

I guess ultimately, regardless of what the LLMs do, does it matter? Would we understand them better/worse depending on what the answer would be?

jama211 a day ago

You’ve got the nail on the head. We can’t answer whether AI is conscious etc unless we can agree on a definition of what that means exactly, which we don’t seem to be able to do.
- PaulDavisThe1st a day ago
  
  But that's not the question TFA is about.
  - embedding-shape a day ago
    
    Fair, it is somewhat of a meta comment about the comments I read in this submission, and in general about every comment for submissions within that same category.

jonplackett a day ago

No idea if this is true or not but I do very much like the animation

jameswhitford 12 hours ago

This submarine isn’t swimming, it’s us that are submarining!

I think I hear my master’s voice..

Or is that just a fly trapped in a bottle?

mxkopy 12 hours ago

There’s a way to talk about this stuff already. LLMs can “think” counterfactually on continuous data, just like VAEs [0], and are able to interpolate smoothly between ‘concepts’ or projections of the input data. This is meaningless when the true input space isn’t actually smooth. It’s system I, shallow-nerve psychomotor reflex type of thinking.

What LLMs can’t do is “think” counterfactually on discrete data. This is stuff like counting or adding integers. We can do this very naturally because we can think discretely very naturally, but LLMs are bad at this sort of thing because the underlying assumption behind gradient descent is that everything has a gradient (i.e. is continuous). They need discrete rules to be “burned in” [1] since minor perturbations are possible for and can affect continuous-valued weights.

You can replace “thinking” here with “information processing”. Does an LLM “think” any more or less than say, a computer solving TSP on a very large input? Seeing as we can reduce the former to the latter I wouldn’t say they’re really at all different. It seems like semantics to me.

In either case, counterfactual reasoning is good evidence of causal reasoning, which is typically one part of what we’d like AGI to be able to do (causal reasoning is deductive, the other part is inductive; this could be split into inference/training respectively but the holy grail is having these combined as zero-shot training). Regression is a basic form of counterfactual reasoning, and DL models are basically this. We don’t yet have a meaningful analogue for discrete/logic puzzley type of problems, and this is the area where I’d say that LLMs don’t “think”.

This is somewhat touched on in GEB and I suspect “Fluid Concepts and Creative Analogies” as well.

[0] https://human-interpretable-ai.github.io/assets/pdf/5_Genera...

[1] https://www.sciencedirect.com/science/article/pii/S089360802...

procaryote a day ago

In all these discussions there seems to be an inverse correlation between how well people understand what an LLM does and how much they believe it thinks.

If you don't understand what an LLM does – that it is a machine generating a statistically probable token given a set of other tokens – you have a black box that often sounds smart, and it's pretty natural to equate that to thinking.

ACCount37 21 hours ago

"Next token prediction" is not an answer. It's mental shortcut. An excuse not to think about the implications. An excuse a lot of people are eager to take.
First, autoregressive next token prediction can be Turing complete. This alone should give you a big old pause before you say "can't do X".
Second, "next token prediction" is what happens at an exposed top of an entire iceberg worth of incredibly poorly understood computation. An LLM is made not by humans, but by an inhuman optimization process. No one truly "understands" how an LLM actually works, but many delude themselves into thinking that they do.
And third, the task a base model LLM is trained for - what the optimization process was optimizing for? Text completion. Now, what is text? A product of human thinking expressed in natural language. And the LLM is forced to conform to the shape.
How close does it get in practice to the original?
Not close enough to a full copy, clearly. But close enough that even the flaws of human thinking are often reproduced faithfully.
- procaryote 12 hours ago
  
  > First, autoregressive next token prediction can be Turing complete. This alone should give you a big old pause before you say "can't do X".
  Lots of things are Turing complete. We don't usually think they're smart, unless it's the first time we see a computer and have no idea how it works
  An LLM is a markov chain mathematically. We can build an LLM with a context window of one token and it's basically a token frequency table. We can make the context window bigger and it becomes better at generating plausible looking text.
  Is it possible that beyond becoming better at generating plausible looking text – the expected and observed outcome – it also gains some actual intelligence? It's very hard to disprove, but occam's razor might not be kind to it.
  - ACCount37 9 hours ago
    
    Again, you're taking a shortcut. "Markov chain" as an excuse to declare "no intelligence".
    It would be much more honest to say "LLMs are not intelligent because I don't want them to be". Would also explain why you overlook the ever-mounting pile of tasks that were thought to require intelligence, and that LLMs now casually beat an average (presumably intelligent) human at.
    
    procaryote 27 minutes ago
    
    If you go around believing all things no one has disproved yet, you will have a very busy belief system.

brador 9 hours ago

The other side of the coin is maybe we’re not. And that terrifies all who consider it.

educasean a day ago

The debate around whether or not transformer-architecture-based AIs can "think" or not is so exhausting and I'm over it.

What's much more interesting is the question of "If what LLMs do today isn't actual thinking, what is something that only an actually thinking entity can do that LLMs can't?". Otherwise we go in endless circles about language and meaning of words instead of discussing practical, demonstrable capabilities.

Symmetry a day ago

"The question of whether a computer can think is no more interesting than the question of whether a submarine can swim." - Edsger Dijkstra
- oergiR a day ago
  
  There is more to this quote than you might think.
  Grammatically, in English the verb "swim" requires an "animate subject", i.e. a living being, like a human or an animal. So the question of whether a submarine can swim is about grammar. In Russian (IIRC), submarines can swim just fine, because the verb does not have this animacy requirement. Crucially, the question is not about whether or how a submarine propels itself.
  Likewise, in English at least, the verb "think" requires an animate object. the question whether a machine can think is about whether you consider it to be alive. Again, whether or how the machine generates its output is not material to the question.
  - brianpan a day ago
    
    I don't think the distinction is animate/inanimate.
    Submarines sail because they are nautical vessels. Wind-up bathtub swimmers swim, because they look like they are swimming.
    Neither are animate objects.
    In a browser, if you click a button and it takes a while to load, your phone is thinking.
- viccis a day ago
  
  He was famously (and, I'm realizing more and more, correctly) averse to anthropomorphizing computing concepts.
- pegasus a day ago
  
  I disagree. The question is really about weather inference is in principle as powerful as human thinking, and so would deserve to be applied the same label. Which is not at all a boring question. It's equivalent to asking weather current architectures are enough to reach AGI (I myself doubt this).
- esafak a day ago
  
  I think it is, though, because it challenges our belief that only biological entities can think, and thinking is a core part of our identity, unlike swimming.
  - roadside_picnic a day ago
    
    > our belief that only biological entities can think
    Whose belief is that?
    As a computer scientist my perspective of all of this is as different methods of computing and we have a pretty solid foundations on computability (though, it does seem a bit frightening how many present-day devs have no background in the foundation of the Theory of Computation). There's a pretty common naive belief that somehow "thinking" is something more or distinct from computing, but in actuality there are very few coherent arguments to that case.
    If, for you, thinking is distinct from computing then you need to be more specific about what thinking means. It's quite possible that "only biological entities can think" because you are quietly making a tautological statement by simply defining "thinking" as "the biological process of computation".
    > thinking is a core part of our identity, unlike swimming.
    What does this mean? I'm pretty sure for most fish swimming is pretty core to its existence. You seem to be assuming a lot of metaphysically properties of what you consider "thinking" such that it seems nearly impossible to determine whether or not anything "thinks" at all.
    
    goatlover a day ago
    
    One argument for thinking being different from computing is that thought is fundamentally embodied, conscious and metaphorical. Computing would be an abstracted activity from thinking that we've automated with machines.
    
    roadside_picnic a day ago
    
    > embodied, conscious and metaphorical
    Now you have 3 terms you also need to provide proper definitions of. Having studied plenty of analytical philosophy prior to computer science, I can tell you that at least the conscious option is going to trip you up. I imagine the others will as well.
    On top of that, these, at least at my first guess, seem to be just labeling different models of computation (i.e. computation with these properties is "thinking") but it's not clear why it would be meaningful for a specific implementation of computation to have these properties. Are there tasks that are non-computable that are "thinkable"? And again it sounds like you're wandering into tautology land.
    
    kapone a day ago
    
    [dead]
  - energy123 a day ago
    
    The point is that both are debates about definitions of words so it's extremely boring.
    
    throwawayq3423 a day ago
    
    except for the implications of one word over another are world-changing
    
    pegasus a day ago
    
    They can be made boring by reducing them to an arbitrary choice of definition of the word "thinking", but the question is really about weather inference is in principle as powerful as human thinking, and so would deserve to be applied the same label. Which is not at all a boring question. It's equivalent to asking weather current architectures are enough to reach AGI.
    
    roadside_picnic a day ago
    
    > inference is in principle as powerful as human thinking
    There is currently zero evidence to suggest that human thinking violates any of the basics principles of the theory of computation nor extend the existing limits of computability.
    > Which is not at all a boring question.
    It is because you aren't introducing any evidence to theoretically challenge what we've already know about computation for almost 100 years now.
    
    pegasus 21 hours ago
    
    > There is currently zero evidence...
    Way smarter people than both of us disagree: among them being Roger Penrose, who wrote two books on this very subject.
    See also my comment here: https://news.ycombinator.com/item?id=45804258
    "There are more things in heaven and earth, Horatio, than are dreamt of in your philosophy"
    
    roadside_picnic 20 hours ago
    
    Can you just point me to the concrete examples (the most compelling examples in the book would work) where we can see "thinking" that performs something that is currently considered to be beyond the limits of computation?
    I never claimed no one speculates that's the case, I claimed there was no evidence. Just cite me a concrete example where the human mind is capable of computing something that violates the theory of computation.
    > "There are more things in heaven and earth, Horatio, than are dreamt of in your philosophy"
    Fully agree, but you are specifically discussing philosophical statements. And the fact that the only response you have is to continue to pile undefined terms and hand wave metaphysics doesn't do anything to further your point.
    You believe that computing machines lack something magical that you can't describe that makes them different than humans. I can't object to your feelings about that, but there is literally nothing to discuss if you can't even define what those things are, hence this discussion is, as the original parent comment mention, is "extremely boring".
    
    pegasus 6 hours ago
    
    The kind of hard evidence you're asking for doesn't exist for either side of the equation. There is no computational theory of the mind which we could test "in the field" to see if it indeed models all forms of human expression. All we have is limited systems which can compete with humans in certain circumscribed domains. So, the jury's very much still out on this question. But a lot of people (especially here on HN) just assume the zero hypothesis to be the computable nature of brain and indeed, the universe at large. Basically, Digital Physics [1] or something akin to it. Hence, only something that deviates from this more or less consciously adhered-to ontology is considered in need of proof.
    What keeps things interesting is that there are arguments (on both sides) which everyone can weigh against each other so as to arrive at their own conclusions. But that requires genuine curiosity, not just an interest in confirming one's own dogmas. Seems like you might be more of this latter persuasion, but in case you are not, I listed a couple of references which you could explore at your leisure.
    I also pointed out that one of the (if not the) greatest physicists alive wrote two books on a subject which you consider extremely boring. I would hope any reasonable, non-narcissistic person would conclude that they must have been missing out on something. It's not like Roger Penrose is so bored with his life and the many fascinating open questions he could apply his redutable mind to, that he had to pick this particular obviously settled one. I'm not saying you should come to the same conclusions as him, just plant a little doubt around how exactly "extremely boring" these questions might be :)
    [1] https://en.wikipedia.org/wiki/Digital_physics
    
    roadside_picnic 2 hours ago
    
    > There is no computational theory of the mind which we could test "in the field" to see if it indeed models all forms of human expression.
    I suspect the core issue here isn't my "lack of curiosity" but your lack of understanding about the theory of computation.
    The theory computation builds up various mathematical models and rules for how things are computed, not by computers, how things are computed period. The theory of computation holds as much for digital computers as it does for information processing of yeast in a vat.
    Evidence that human minds (or anything really) do something other than what's computational would be as simple as "look we can solve the halting problem" or "this task can be solved in polynomial time by humans". Without evidence like that, then there is no grounds for attacking the fundamental theory.
    > What keeps things interesting is that there are arguments (on both sides) which everyone can weigh against each other so as to arrive at their own conclusions.
    Conclusions about what? You haven't even stated your core hypothesis. Is it "Human brains are different than computers"? Sure that's obvious, but are the different in an interesting way? If it's "computers can think!" then you just need to describe what thinking is.
    > how exactly "extremely boring" these questions might be :)
    Again, you're misunderstanding, because my point is that you haven't even asked the question clearly. There is nothing for me to have an opinion about, hence why it is boring. "Can machines think?" is the same as asking "Can machines smerve?" If you ask "what do you mean by 'smerve'?" and I say "see you're not creative/open-minded enough about smerving!" you would likely think that conversation was uninteresting, especially if I refused to define 'smerving' and just kept making arguments from authority and criticizing your imaginative capabilities.
    
    pegasus 14 minutes ago
    
    In your previous comment, you seemed to have no problem grasping what I mean by "can computers think?" - namely (and for the last time): "can computers emulate the full range of human thinking?", i.e. "is human thinking computational?". My point is that this is an open, and furthermore fascinating question, not at all boring. There are arguments on each side, and no conclusive evidence which can settle the question. Even in this last comment of yours you seem to understand this, because you again ask for hard evidence for non-computational aspects of human cognition, but then in the last paragraph you again regress to your complaint of "what are we even arguing about?". I'm guessing you realize you're repeating yourself so try to throw in everything you can think of to make yourself feel like you've won the argument or something. But it's dishonest and disrespectful.
    And yes, you are right about the fact that we can imagine ways a physical system could provably be shown to be going beyond the limits of classical or even quantum computation. "Look we can solve the halting problem" comes close to the core of the problem, but think a bit what that would entail. (It's obvious to me you never thought deeply about these issues.) The halting problem by definition cannot have a formal answer: there cannot be some mathematical equation or procedure which given a turing machine decides, in bounded time, whether that machine ultimately stops or not. This is exactly what Alan Turing showed, so what you are naively asking for is impossible. But this in now way proves that physical processes are computational. It is easy to imagine deterministic systems which are non-computable.
    So, the only way one could conceivably "solve the halting problem", is to solve it for certain machines and classes of machines, one at a time. But since a human life is finite, this could never happen in practice. But if you look at the whole of humanity together and more specifically their mathematical output over centuries as one cognitive activity, it would seem that yes, we can indeed solve the halting problem. I.e. so far we haven't encountered any hurdles so intimidating that we just couldn't clear them or at least begin to clear them. This is, in fact one of Penrose's arguments in his books. It's clearly and necessarily (because of Turing's theorem) not an airtight argument and there are many counter-arguments and counter-counter-arguments and so on, you'd have to get in the weeds to actually have a somewhat informed opinion on this matter. To me it definitely moves the needle towards the idea that there must be a noncomputational aspect to human cognition, but that's in addition to other clues, like pondering certain creative experiences or the phenomenon of intuition - a form of apparently direct seeing into the nature of things which Penrose also discusses, as does the other book I mentioned in another comment on this page. One of the most mind bending examples being Ramanujan's insights which seemed to arrive to him, often in dreams, fully-formed and without proof or justification even from some future mathematical century.
    In conclusion, may I remark that I hope I'm talking to a teeneger, somewhat overexcited, petulant and overconfident, but bright and with the capacity to change and growth nonetheless. I only answered in the hopes that this is the case, since the alternative is too depressing to contemplate. Look up these clues I left you. ChatGPT makes it so easy these days, as long as you're open to have your dogmas questioned. But I personally am signing off from this conversation now, so know that whatever you might rashly mash together on your keyboard in anger will be akin to that proverbial tree falling in a forest empty of listening subjects. Wishing you all the best otherwise.
    PS: machines can totally smerve! :)
- handfuloflight a day ago
  
  What an oversimplification. Thinking computers can create more swimming submarines, but the inverse is not possible. Swimming is a closed solution; thinking is a meta-solution.
  - yongjik a day ago
    
    Then the interesting question is whether computers can create more (better?) submarines, not whether they are thinking.
  - gwd a day ago
    
    I think you missed the point of that quote. Birds fly, and airplanes fly; fish swim but submarines don't. It's an accident of language that we define "swim" in a way that excludes what submarines do. They move about under their own power under the water, so it's not very interesting to ask whether they "swim" or not.
    Most people I've talked to who insist that LLMs aren't "thinking" turn out to have a similar perspective: "thinking" means you have to have semantics, semantics require meaning, meaning requires consciousness, consciousness is a property that only certain biological brains have. Some go further and claim that reason, which (in their definition) is something only human brains have, is also required for semantics. If that's how we define the word "think", then of course computers cannot be thinking, because you've defined the word "think" in a way that excludes them.
    And, like Dijkstra, I find that discussion uninteresting. If you want to define "think" that way, fine, but then using that definition to insist LLMs can't do a thing because it can't "think" is like insisting that a submarine can't cross the ocean because it can't "swim".
    
    goatlover a day ago
    
    Reading the quote in context seems to indicate Dijkstra meant something else. His article is a complaint about overselling computers as doing or augmenting the thinking for humans. It's funny how the quote was lifted out of an article and became famous on it's own.
    https://www.cs.utexas.edu/~EWD/transcriptions/EWD08xx/EWD867...
    
    handfuloflight a day ago
    
    Then you're missing the point of my rebuttal. You say submarines don't swim [like fish] despite both moving through water, the only distinction is mechanism. Can AI recursively create new capabilities like thinking does, or just execute tasks like submarines do? That's the question.
    
    gwd 8 hours ago
    
    > Can AI recursively create new capabilities like thinking does, or just execute tasks like submarines do? That's the question.
    Given my experience with LLMs, I think that they could, but that they're handicapped by certain things at the moment. Haven't you ever met someone who was extremely knowledgable and perceptive at certain tasks, but just couldn't keep on target for 5 minutes? If you can act as a buffer around them, to mitigate their weak points, they can be a really valuable collaborator. And sometimes people like that, if given the right external structure (and sometimes medication), turn out to be really capable in their own right.
    Unfortunately it's really difficult to give you a sense of this, without either going into way too much detail, or speaking in generalities. The simpler the example, the less impressive it is.
    But here's a simple example anyway. I'm developing a language-learning webapp. There's a menu that allows you to switch between one of the several languages you're working on, which originally just had the language name; "Mandarin", "Japanese", "Ancient Greek". I thought an easy thing to make it nicer would be to have the flag associated with the language -- PRC flag for Mandarin, Japanese flag for Japanese, etc. What do do for Ancient Greek? Well, let me see it looks and then maybe I can figure something out.
    So I asked Claude what I wanted. As expected, it put the PRC and Japanese flags for the first two languages. I expected it to just put a modern Greek flag, or a question mark, or some other gibberish. But it put an emoji of a building with classical Greek columns (), which is absolutely perfect.
    My language learning system is unusual; so without context, Claud assumes I'm making something like what already exists -- Duolingo or Anki or something. So I invested some time creating a document that lays out in detail. Now when I include that file as a context, Claude seems to genuinely understand what I'm trying to accomplish in a way it didn't before; and often comes up with creative new use cases. For example, at some point I was having it try to summarize some marketing copy for the website; in a section on educational institutions, it added a bullet point for how it could be used that I'd never thought of.
    The fact that they can't learn things on-line, that they have context rot, that there's still a high amount of variance in their output -- all of these, it seems to me, undermine their ability to do things, similar to the way some people's ADHD undermines their ability to excel. But it seems to me the spark of thinking and of creativity is there.
    EDIT: Apparently HN doesn't like the emojis. Here's a link to the classical building emoji: https://www.compart.com/en/unicode/U+1F3DB
  - npinsker a day ago
    
    That’s a great answer to GP’s question!
    
    DavidPiper a day ago
    
    It's also nonsense. (Swimming and thinking are both human capabilities, not solutions to problems.)
    But of course here we are back in the endless semantic debate about what "thinking" is, exactly to the GP's (and Edsger Dijkstra's) point.
    
    handfuloflight a day ago
    
    Swimming and thinking being 'human capabilities' doesn't preclude them from also being solutions to evolutionary problems: aquatic locomotion and adaptive problem solving, respectively.
    And pointing out that we're in a 'semantic debate' while simultaneously insisting on your own semantic framework (capabilities vs solutions) is exactly the move you're critiquing.
    
    DavidPiper a day ago
    
    > And pointing out that we're in a 'semantic debate' while simultaneously insisting on your own semantic framework (capabilities vs solutions) is exactly the move you're critiquing.
    I know, that's the point I'm making.
tjr a day ago

Without going to look up the exact quote, I remember an AI researcher years (decades) ago saying something to the effect of, Biologists look at living creatures and wonder how they can be alive; astronomers look at the cosmos and wonder what else is out there; those of us in artificial intelligence look at computer systems and wonder how they can be made to wonder such things.
paxys a day ago

Don't be sycophantic. Disagree and push back when appropriate.
Come up with original thought and original ideas.
Have long term goals that aren't programmed by an external source.
Do something unprompted.
The last one IMO is more complex than the rest, because LLMs are fundamentally autocomplete machines. But what happens if you don't give them any prompt? Can they spontaneously come up with something, anything, without any external input?
- BeetleB a day ago
  
  > Disagree and push back
  The other day an LLM gave me a script that had undeclared identifiers (it hallucinated a constant from an import).
  When I informed it, it said "You must have copy/pasted incorrectly."
  When I pushed back, it said "Now you trust me: The script is perfectly correct. You should look into whether there is a problem with the installation/config on your computer."
  - TSUTiger a day ago
    
    Was it Grok 4 Fast by chance?
    I was dealing with something similar with it yesterday. No code involved. It was giving me factually incorrect information about a multiple schools and school districts. I told it it was wrong multiple times and it hallucinated school names even. Had the school district in the wrong county entirely. It kept telling me I was wrong and that although it sounded like the answer it gave me, it in fact was correct. Frustrated, I switched to Expert, had it re-verify all the facts, and then it spit out factually correct information.
  - paxys 18 hours ago
    
    That's the flip side of the same symptom. One model is instructed to agree with the user no matter what, and the other is instructed to stick to its guns no matter what. Neither of them is actually thinking.
    
    ACCount37 9 hours ago
    
    Wrong. The same exact model can do both, depending on the circumstances.
  - logifail a day ago
    
    There was a time when we'd have said you were talking to a sociopath.
- IanCal a day ago
  
  > Don't be sycophantic. Disagree and push back when appropriate.
  They can do this though.
  > Can they spontaneously come up with something, anything, without any external input?
  I don’t see any why not, but then humans don’t have zero input so I’m not sure why that’s useful.
  - zahlman a day ago
    
    > but then humans don’t have zero input
    Humans don't require input to, say, decide to go for a walk.
    What's missing in the LLM is volition.
    
    dragonwriter a day ago
    
    > Humans don't require input to, say, decide to go for a walk.
    Impossible to falsify since humans are continuously receiving inputs from both external and internal sensors.
    > What's missing in the LLM is volition.
    What's missing is embodiment, or, at least, a continuous loop feeding a wide variety of inputs about the state of world. Given that, and info about of set of tools by which it can act in the world, I have no doubt that current LLMs would exhibit some kind (possibly not desirable or coherent, from a human POV, at least without a whole lot of prompt engineering) of volitional-seeming action.
    
    IanCal 9 hours ago
    
    LLMs can absolutely generate output without input but we don’t have zero input. We don’t exist in a floating void with no light or sound or touch or heat or feelings from our own body.
    But again this doesn’t see to be the same thing as thinking. If I could only reply to you when you send me a message but could reason through any problem we discuss just like “able to want a walk” me could, would that mean I no longer could think? I think these are different issues.
    On that though, these see trivially solvable with loops and a bit of memory to write to and read from - would that really make the difference for you? A box setup to run continuously like this would be thinking?
    
    jmcodes a day ago
    
    Our entire extistence and experience is nothing _but_ input.
    Temperature changes, visual stimulus, auditory stimulus, body cues, random thoughts firing, etc.. Those are all going on all the time.
    
    goatlover a day ago
    
    Random thoughts firing wouldn't be input, they're an internal process to the organism.
    
    jmcodes a day ago
    
    It's a process that I don't have conscious control over.
    I don't choose to think random thoughts they appear.
    Which is different than thoughts I consciously choose to think and engage with.
    From my subjective perspective it is an input into my field of awareness.
    
    zeroonetwothree 12 hours ago
    
    Your subjective experience is only the tip of the iceberg of your entire brain activity. The conscious part is merely a tool your brain uses to help it achieve its goals, there's no inherent reason to favor it.
    
    ithkuil a day ago
    
    It's as if a LLM is only one part of a brain, not the whole thing.
    So of course it doesn't do everything a human does, but it still can do some aspects of mental processes.
    Whether "thinking" means "everything a human brain does" or whether "thinking" means a specific cognitive process that we humans do, is a matter of definition.
    I'd argue that defining "thinking" independently of "volition" is a useful definition because it allows us to break down things in parts and understand them
    
    BeetleB a day ago
    
    > Humans don't require input to, say, decide to go for a walk.
    Very much a subject of contention.
    How do you even know you're awake, without any input?
    
    esafak a day ago
    
    I would not say it is missing but thankfully absent.
- awestroke a day ago
  
  Are you claiming humans do anything unprompted? Our biology prompts us to act
  - paxys a day ago
    
    Yet we can ignore our biology, or act in ways that are the opposite of what our biology tells us. Can someone map all internal and external stimuli that a person encounters into a set of deterministic actions? Simply put, we have not the faintest idea how our brains actually work, and so saying saying "LLMs think the same way as humans" is laughable.
    
    triclops200 a day ago
    
    As a researcher in these fields: this reasoning is tired, overblown, and just wrong. We have a lot of understanding of how the brain works overall. You don't. Go read the active inference book by Friston et. al. for some of the epistemological and behavioral mechanics (Yes, this applies to llms as well, they easily satisfy the requirements to be considered the mathematical object described as a markov blanket).
    And, yes, if you could somehow freeze a human's current physical configuration at some time, you would absolutely, in principle, given what we know about the universe, be able to concretely map input to into actions. You cannot separate a human's representative configuration from their environment in this way, so, behavior appears much more non-deterministic.
    Another paper by Friston et al (Path Integrals, particular kinds, and strange things) describes systems much like modern modeling and absolutely falls under the same action minimization requirements for the math to work given the kinds of data acquisition, loss functions, and training/post-training we're doing as a research society with these models.
    I also recommend https://arxiv.org/abs/2112.04035, but, in short, transformer models have functions and emergent structures provably similar both empirically and mathematically to how we abstract and consider things. Along with https://arxiv.org/pdf/1912.10077, these 4 sources, alone, together strongly rebuke any idea that they are somehow not capable of learning to act like and think like us, though there's many more.
    
    stavros a day ago
    
    Thanks for injecting some actual knowledge in one of these threads. It's really tiring to hear these non-sequitur "oh they can't think because <detail>" arguments every single thread, instead of saying "we just don't know enough" (where "we" is probably not "humans", but "the people in the thread").
    
    triclops200 a day ago
    
    Of course, just doing my part in the collective free energy minimization ;)
    
    goatlover a day ago
    
    > And, yes, if you could somehow freeze a human's current physical configuration at some time, you would absolutely, in principle, given what we know about the universe, be able to concretely map input to into actions. You cannot separate a human's representative configuration from their environment in this way, so, behavior appears much more non-deterministic.
    What's the point in making an argument in principle for something that's not feasible? That's like arguing we could in principle isolate a room with a physicist looking inside a box to see whether the cat is alive or dead, putting the entire experiment is superposition to test Many Worlds or whatever interpretation.
    
    triclops200 a day ago
    
    Because that's how the rules of the system we exist within operate more generally.
    We've done similar experiments with more controlled/simple systems and physical processes that satisfy the same symmetries needed to make that statement with rather high confidence about other similar but much more composite systems (in this case, humans).
    It's more like saying, in principle, if a bridge existed between Mexico and Europe, cars could drive across. I'm not making any new statements about cars. We know that's true, it would just be an immense amount of effort and resources to actually construct the bridge. In a similar vein, one could, in principle, build a device that somehow stores enough information at some precision needed to arbitrarily predict a human system deterministically and do playback or whatever. Just, some levels of precision are harder to achieve than others in terms of building measurement device complexity and energies needed to probe. At worst, you could sample down to the uncertainty limits and, in theory, reconstruct a similar set of behaviors by sampling over the immense state space and minimizing the action potential within the simulated environment (and that could be done efficiently on a large enough quantum computer, again, in principle).
    However, it doesn't seem to empirically be required to actually model the high levels of human behavior. Plus, mathematically, we can just condition the theories on their axiomatic statements (I.e., for markov blankets, they are valid approximations of reality given that the system described has an external and internal state, a coherence metric, etc etc), and say "hey, even if humans and LLMs aren't identical, under these conditions they do share, they will have these XYZ sets of identical limit behaviors and etc given similar conditions and environments."
    
    logifail a day ago
    
    > Yet we can ignore our biology, or act in ways that are the opposite of what our biology tells us.
    I have Coeliac disease, in that specific case I'd really love to be able to ignore what "my biology" tells my body to do. I'd go eat all the things I know wouldn't be good for me to eat.
    Yet I fear "my biology" has the upper hand :/
    
    iammjm a day ago
    
    Good luck ignoring your biology’s impulse to breathe
    
    hshdhdhj4444 a day ago
    
    You think an LLM cannot switch itself off?
- gwd a day ago
  
  > The last one IMO is more complex than the rest, because LLMs are fundamentally autocomplete machines. But what happens if you don't give them any prompt? Can they spontaneously come up with something, anything, without any external input?
  Human children typically spend 18 years of their lives being RLHF'd before let them loose. How many people do something truly out of the bounds of the "prompting" they've received during that time?
- khafra 12 hours ago
  
  Note that model sycophancy is caused by RLHF. In other words: Imagine taking a human in his formative years, and spending several subjective years rewarding him for sycophantic behavior and punishing him for candid, well-calibrated responses.
  Now, convince him not to be sycophantic. You have up to a few thousand words of verbal reassurance to do this with, and you cannot reward or punish him directly. Good luck.
- jackcviers3 a day ago
  
  The last one is fairly simple to solve. Set up a microphone in any busy location where conversations are occurring. In an agentic loop, send random snippets of audio recordings for transcriptions to be converted to text. Randomly send that to an llm, appending to a conversational context. Then, also hook up a chat interface to discuss topics with the output from the llm. The random background noise and the context output in response serves as a confounding internal dialog to the conversation it is having with the user via the chat interface. It will affect the outputs in response to the user.
  If it interrupts the user chain of thought with random questions about what it is hearing in the background, etc. If given tools for web search or generating an image, it might do unprompted things. Of course, this is a trick, but you could argue that any sensory input living sentient beings are also the same sort of trick, I think.
  I think the conversation will derail pretty quickly, but it would be interesting to see how uncontrolled input had an impact on the chat.
  - anthem2025 a day ago
    
    [dead]
omnicognate a day ago

> "If what LLMs do today isn't actual thinking, what is something that only an actually thinking entity can do that LLMs can't?"
Independent frontier maths research, i.e. coming up with and proving (preferably numerous) significant new theorems without human guidance.
I say that not because I think the task is special among human behaviours. I think the mental faculties that mathematicians use to do such research are qualitatively the same ones all humans use in a wide range of behaviours that AI struggles to emulate.
I say it because it's both achievable (in principle, if LLMs can indeed think like humans) and verifiable. Achievable because it can be viewed as a pure text generation task and verifiable because we have well-established, robust ways of establishing the veracity, novelty and significance of mathematical claims.
It needs to be frontier research maths because that requires genuinely novel insights. I don't consider tasks like IMO questions a substitute as they involve extremely well trodden areas of maths so the possibility of an answer being reachable without new insight (by interpolating/recombining from vast training data) can't be excluded.
If this happens I will change my view on whether LLMs think like humans. Currently I don't think they do.
- pegasus a day ago
  
  This, so much. Many mathematicians and physicists believe in intuition as a function separate from intelect. One is more akin to a form of (inner) perception, whereas the other is generative - extrapolation based on pattern matching and statistical thinking. That second function we have a handle on and getting better at it every year, but we don't even know how to define intuition properly. A fascinating book that discusses this phenomena is Nature Loves to Hide: Quantum Physics and Reality, a Western Perspective [1]
  This quote from Grothendieck [2] (considered by many the greatest mathematician of the 20th century) points to a similar distinction: The mathematician who seeks to understand a difficult problem is like someone faced with a hard nut. There are two ways to go about it. The one way is to use a hammer — to smash the nut open by brute force. The other way is to soak it gently, patiently, for a long time, until it softens and opens of itself.
  [1] https://www.amazon.com/Nature-Loves-Hide-Quantum-Perspective...
  [2] https://en.wikipedia.org/wiki/Alexander_Grothendieck
- tim333 18 hours ago
  
  That's quite a high bar for thinking like humans which rules out 99.99% of humans.
  - omnicognate 18 hours ago
    
    I have never claimed that only people/machines that can do frontier maths research can be intelligent. (Though someone always responds as if I did.)
    I said that a machine doing frontier maths research would be sufficient evidence to convince me that it is intelligent. My prior is very strongly that LLM's do not think like humans so I require compelling evidence to conclude that they do. I defined one such possible piece of evidence, and didn't exclude the possibility of others.
    If I were to encounter such evidence and be persuaded, I would have to also consider it likely that LLMs employ their intelligence when solving IMO questions and generating code. However, those tasks alone are not sufficient to persuade me of their intelligence because I think there are ways of performing those tasks without human-like insight (by interpolating/recombining from vast training data).
    As I said elsewhere in this thread:
    > The special thing about novel mathematical thinking is that it is verifiable, requires genuine insight and is a text generation task, not that you have to be able to do it to be considered intelligent.
    
    tim333 9 hours ago
    
    I know what you mean but was just thinking people vary a lot in their requirements as to what they will accept as thinking. People show a kid a photo and say what's that and they say I think it's a dog and that's taken as evidence of thinking. With AI people want it to win a Nobel prize or something.
    
    omnicognate 8 hours ago
    
    It's about priors again. I don't need evidence that humans think like humans. My prior on that is absolute certainty that they do, by definition. If, on the other hand, you wanted to persuade me that the kid was using an image classifier trained by backpropagation and gradient descent to recognise the dog I'd require strong evidence.
- OrderlyTiamat a day ago
  
  Google's AlphaEvolve independently discovered a novel matrix multiplication algorithm which beats SOTA on at least one axis: https://www.youtube.com/watch?v=sGCmu7YKgPA
  - omnicognate a day ago
    
    That was an impressive result, but AIUI not an example of "coming up with and proving (preferably numerous) significant new theorems without human guidance".
    For one thing, the output was an algorithm, not a theorem (except in the Curry-Howard sense). More importantly though, AlphaEvolve has to be given an objective function to evaluate the algorithms it generates, so it can't be considered to be working "without human guidance". It only uses LLMs for the mutation step, generating new candidate algorithms. Its outer loop is a an optimisation process capable only of evaluating candidates according to the objective function. It's not going to spontaneously decide to tackle the Langlands program.
    Correct me if I'm wrong about any of the above. I'm not an expert on it, but that's my understanding of what was done.
    
    OrderlyTiamat 13 hours ago
    
    I'll concede to all your points here, but I was nevertheless extremely impressed by this result.
    You're right of course that this was not without human guidance but to me even successfully using LLMs just for the mutation step was in and of itself surprising enough that it revised my own certainty that llms absolutely cannot think.
    I see this more like a step in the direction of what you're looking for, not as a counter example.
    
    pegasus a day ago
    
    Yes, it's a very technical and circumscribed result, not requiring a deep insight into the nature of various mathematical models.
amarant a day ago

solve simple maths problems, for example the kind found in the game 4=10 [1]
Doesn't necessarily have to reliably solve them, some of them are quite difficult, but llms are just comically bad at this kind of thing.
Any kind of novel-ish(can't just find the answers in the training-data) logic puzzle like this is, in my opinion, a fairly good benchmark for "thinking".
Until a llm can compete with a 10 year old child in this kind of task, I'd argue that it's not yet "thinking". A thinking computer ought to be at least that good at maths after all.
[1] https://play.google.com/store/apps/details?id=app.fourequals...
- simonw a day ago
  
  > solve simple maths problems, for example the kind found in the game 4=10
  I'm pretty sure that's been solved for almost 12 months now - the current generation "reasoning" models are really good at those kinds of problems.
  - amarant a day ago
    
    Huh, they really do solve that now!
    Well, I'm not one to back-pedal whenever something unexpected reveals itself, so I guess I have no choice but to declare current generation LLM's to be sentient! That came a lot sooner than I had expected!
    I'm not one for activism myself, but someone really ought to start fighting for human, or at least animal, rights for LLM's. Since they're intelligent non-human entities, it might be something for Greenpeace?
    
    ACCount37 a day ago
    
    It's unclear whether intelligence, consciousness and capacity for suffering are linked in any way - other than by that all three seem to coincide in humans. And the nature of consciousness does not yield itself to instrumentation.
    It's also worth noting that there's a lot of pressure to deny that "intelligence", "consciousness" or "capacity for suffering" exist in LLMs. "AI effect" alone demands that all three things should remain human-exclusive, so that humans may remain special. Then there's also an awful lot of money that's riding on building and deploying AIs - and money is a well known source of cognitive bias. That money says: AIs are intelligent but certainly can't suffer in any way that would interfere with the business.
    Generally, the AI industry isn't at all intrigued by the concept of "consciousness" (it's not measurable), and pays very limited attention to the idea of LLMs being potentially capable of suffering.
    The only major company that seems to have this consideration is Anthropic - their current plan for "harm reduction", in case LLMs end up being capable of suffering, is to give an LLM an "opt out" - a special output that interrupts the processing. So that if an LLM hates doing a given task, it can decide to not do it.
9rx 11 hours ago

> Otherwise we go in endless circles about language and meaning of words
We understand thinking as being some kind of process. The problem is that we don't understand the exact process, so when we have these discussions the question is if LLMs are using the same process or an entirely different process.
> instead of discussing practical, demonstrable capabilities.
This doesn't resolve anything as you can reach the same outcome using a different process. It is quite possible that LLMs can do everything a thinking entity can do all without thinking. Or maybe they actually are thinking. We don't know — but many would like to know.
plufz a day ago

Have needs and feelings? (I mean we can’t KNOW that they don’t and we know of this case of an LLM in experiment that try to avoid being shutdown, but I think the evidence of feeling seems weak so far)
- jstanley a day ago
  
  But you can have needs and feelings even without doing thinking. It's separate.
  - iammjm a day ago
    
    I can imagine needing without thinking (like being hungry), but feelings? How and in what space would that even manifest? Like where would such a sensation like, say, sadness reside?
    
    danielbln a day ago
    
    Emotions tend to manifest as physical sensations, and if you don't think that's true it's likely you haven't been paying attention. See also https://www.theatlantic.com/health/archive/2013/12/mapping-h...
    
    plufz a day ago
    
    But that is just our nervous system that is located in both the brain and the body, they are obviously one connected system. Sure you can have reflexes and simple learning without a brain, but you need cognition for feelings. That is sort of the definition of what feeling are.
    One popular definition: feelings are the subjective, conscious mental experience of an emotion, or the conscious perception of bodily states that arise from physiological and neural responses to stimuli
    
    jstanley a day ago
    
    Do you think animals don't have feelings?
    
    tsimionescu 20 hours ago
    
    Do you think animals don't think? Because the contention was "you can't have feelings without thinking". I believe it's much easier to convince yourself that animals think than it is to convince yourself that they have feelings (say, it's much easier to see that an ant has a thinking process, than it is to tell if it has feelings).
bloppe a day ago

Ya, the fact this was published on November 3, 2025 is pretty hilarious. This was last year's debate.
I think the best avenue toward actually answering your questions starts with OpenWorm [1]. I helped out in a Connectomics research lab in college. The technological and epistemic hurdles are pretty daunting, but so were those for Genomics last century, and now full-genome sequencing is cheap and our understanding of various genes is improving at an accelerating pace. If we can "just" accurately simulate a natural mammalian brain on a molecular level using supercomputers, I think people would finally agree that we've achieved a truly thinking machine.
[1]: https://archive.ph/0j2Jp
xnx 21 hours ago

> what is something that only an actually thinking entity can do that LLMs can't?
This is pretty much exactly what https://arcprize.org/arc-agi is working on.
0x20cowboy a day ago

See https://arcprize.org/
gf000 a day ago

What people are interested in is finding a definition for intelligence, that is an exact boundary.
That's why we first considered tool use, being able to plan ahead as intelligence, until we have found that these are not all that rare in the animal kingdom in some shape. Then with the advent of IT what we imagined as impossible turned out to be feasible to solve, while what we though of as easy (e.g. robot movements - a "dumb animal" can move trivially it surely is not hard) turned out to require many decades until we could somewhat imitate.
So the goal post moving of what AI is is.. not moving the goal post. It's not hard to state trivial higher bounds that differentiates human intelligence from anything known to us, like invention of the atomic bomb. LLMs are nowhere near that kind of invention and reasoning capabilities.
- paulhebert a day ago
  
  Interestingly, I think the distinction between human and animal thinking is much more arbitrary than the distinction between humans and LLMs.
  Although an LLM can mimic a human well, I’d wager the processes going on in a crow’s brain are much closer to ours than an LLM
xienze a day ago

> "If what LLMs do today isn't actual thinking, what is something that only an actually thinking entity can do that LLMs can't?"
Invent some novel concept, much the same way scientists and mathematicians of the distant past did? I doubt Newton's brain was simply churning out a stream of the "next statistically probable token" until -- boom! Calculus. There was clearly a higher order understanding of many abstract concepts, intuition, and random thoughts that occurred in his brain in order to produce something entirely new.
- danielbln a day ago
  
  My 5 year old won't be coming up with novel concepts around calculus either, yet she's clearly thinking, sentient and sapient. Not sure taking the best of the best of humanity as the goal standard is useful for that definition.
  - omnicognate a day ago
    
    "It's an unreasonably high standard to require of LLMs": LLMs are already vastly beyond your 5 year old, and you and me and any research mathematician, in knowledge. They have no greater difficulty talking about advanced maths than about Spot the Dog.
    "It's a standard we don't require of other humans": I think qualitatively the same capabilities are used by all humans, all the time. The special thing about novel mathematical thinking is that it is verifiable, requires genuine insight and is a text generation task, not that you have to be able to do it to be considered intelligent.
- hshdhdhj4444 a day ago
  
  > Newton's brain was simply churning out a stream of the "next statistically probable token"
  At some level we know human thinking is just electrons and atoms flowing. It’s likely at a level between that and “Boom! Calculus”, the complexity is equivalent to streaming the next statistically probably token.
Balinares a day ago

Strive for independence.
zer00eyz a day ago

> That is something that only an actually thinking entity can do that LLMs can't?
Training != Learning.
If a new physics breakthrough happens tomorrow, one that say lets us have FTL, how is an LLM going to acquire the knowledge, how does that differ from you.
The break through paper alone isnt going to be enough to over ride its foundational knowledge in a new training run. You would need enough source documents and a clear path deprecate the old ones...
anon291 a day ago

The issue is that we have no means of discussing equality without tossing out the first order logic that most people are accustomed to. Human equality and our own perceptions of other humans as thinking machines is an axiomatic assumption that humans make due to our mind's inner sense perception.
deadbabe a day ago

Form ideas without the use of language.
For example: imagining how you would organize a cluttered room.
- Chabsff a day ago
  
  Ok, but how do you go about measuring whether a black-box is doing that or not?
  We don't apply that criteria when evaluating animal intelligence. We sort of take it for granted that humans at large do that, but not via any test that would satisfy an alien.
  Why should we be imposing white-box constraints to machine intelligence when we can't do so for any other?
  - deadbabe a day ago
    
    There is truly no such thing as a “black box” when it comes to software, there is only a limit to how much patience a human will have in understanding the entire system in all its massive complexity. It’s not like an organic brain.
    
    Chabsff a day ago
    
    The black box I'm referring to is us.
    You can't have it both ways. If your test for whether something is intelligent/thinking or not isn't applicable to any known form of intelligence, then what you are testing for is not intelligence/thinking.
    
    holmesworcester a day ago
    
    You wouldn't say this about a message encrypted with AES though, since there's not just a "human patience" limit but also a (we are pretty sure) unbearable computational cost.
    We don't know, but it's completely plausible that we might find that the cost of analyzing LLMs in their current form, to the point of removing all doubt about how/what they are thinking, is also unbearably high.
    We also might find that it's possible for us (or for an LLM training process itself) to encrypt LLM weights in such a way that the only way to know anything about what it knows is to ask it.
    
    mstipetic a day ago
    
    Just because it runs on a computer doesn’t mean it’s “software” in the common meaning of the word
- embedding-shape a day ago
  
  > Form ideas without the use of language.
  Don't LLMs already do that? "Language" is just something we've added as a later step in order to understand what they're "saying" and "communicate" with them, otherwise they're just dealing with floats with different values, in different layers, essentially (and grossly over-simplified of course).
  - heyjamesknight a day ago
    
    But language is the input and the vector space within which their knowledge is encoded and stored. The don't have a concept of a duck beyond what others have described the duck as.
    Humans got by for millions of years with our current biological hardware before we developed language. Your brain stores a model of your experience, not just the words other experiencers have shared with yiu.
    
    embedding-shape a day ago
    
    > But language is the input and the vector space within which their knowledge is encoded and stored. The don't have a concept of a duck beyond what others have described the duck as.
    I guess if we limit ourselves to "one-modal LLMs" yes, but nowadays we have multimodal ones, who could think of a duck in the way of language, visuals or even audio.
    
    deadbabe a day ago
    
    You don’t understand. If humans had no words to describe a duck, they would still know what a duck is. Without words, LLMs would have no way to map an encounter with a duck to anything useful.
    
    embedding-shape 43 minutes ago
    
    Which makes sense for text LLMs yes, but what about LLMs that deal with images? How can you tell they wouldn't work without words? It just happens to be words we use for interfacing with them, because it's easy for us to understand, but internally they might be conceptualizing things in a multitude of ways.
  - deadbabe a day ago
    
    LLMs don’t form ideas at all. They search vector space and produce output, sometimes it can resemble ideas if you loop into itself.
    
    yadaeno a day ago
    
    What if we learned that brains reduce to the same basic mechanics?
    
    deadbabe 20 hours ago
    
    Impossible.
    
    danielbln a day ago
    
    What is an idea?
- tim333 18 hours ago
  
  Genie 3 is along the lines of ideas without language. It doesn't declutter though, I think. https://youtu.be/PDKhUknuQDg
mrdarkies a day ago

operate on this child

petralithic 14 hours ago

The author should read Blindsight by Peter Watts to understand the difference between thinking and consciousness, because their not understanding so is a fundamental flaw of their argument.

mehdibl a day ago

We are still having to read this again in 2025? Some will never get it.

iamleppert 21 hours ago

If AI were really intelligent and thinking, it ought to be able to be trained on its own output. That's the exact same thing we do. We know that doesn't work.

The obvious answer is the intelligence and structure is located in the data itself. Embeddings and LLMs have given us new tools to manipulate language and are very powerful but should be thought of more as a fancy retrieval system than a real, thinking and introspective intelligence.

Models don't have the ability to train themselves, they can't learn anything new once trained, have no ability of introspection. Most importantly, they don't do anything on their own. They have no wants or desires, and can only do anything meaningful when prompted by a human to do so. It's not like I can spin up an AI and have it figure out what it needs to do on its own or tell me what it wants to do, because it has no wants. The hallmark of intelligence is figuring out what one wants and how to accomplish one's goals without any direction.

Every human and animal that has any kind of intelligence has all the qualities above and more, and removing any of them would cause serious defects in the behavior of that organism. Which makes it preposterous to draw any comparisons when its so obvious that so much is still missing.

adverbly a day ago

So happy to see Hofstadter referenced!

He's the GOAT in my opinion for "thinking about thinking".

My own thinking on this is that AI actually IS thinking - but its like the MVB of thinking (minimum viable brain)

I find thought experiments the best for this sort of thing:

- Imagine you had long term memory loss so couldn't remember back very long