Did nobody really question the usability of language models in designing war strategies?

  • anteaters@feddit.de
    link
    fedilink
    English
    arrow-up
    68
    arrow-down
    1
    ·
    edit-2
    9 months ago

    Did nobody really question the usability of language models in designing war strategies?

    Correct, people heard “AI” and went completely mad imagining things it might be able to do. And the current models act like happy dogs that are eager to give an answer to anything even if they have to make one up on the spot.

    • SlopppyEngineer@lemmy.world
      link
      fedilink
      English
      arrow-up
      31
      arrow-down
      11
      ·
      9 months ago

      LLM are just plagiarizing bullshitting machines. It’s how they are built. Plagiarism if they have the specific training data, modify the answer if they must, make it up from whole cloth as their base programming. And accidentally good enough to convince many people.

      • Blueberrydreamer@lemmynsfw.com
        link
        fedilink
        English
        arrow-up
        13
        arrow-down
        3
        ·
        9 months ago

        How is that structurally different from how a human answers a question? We repeat an answer we “know” if possible, assemble something from fragments of knowledge if not, and just make something up from basically nothing if needed. The main difference I see is a small degree of self reflection, the ability to estimate how ‘good or bad’ the answer likely is, and frankly plenty of humans are terrible at that too.

        • kibiz0r@midwest.social
          link
          fedilink
          English
          arrow-up
          1
          ·
          edit-2
          9 months ago

          I dare say that if you ask a human “Why should I not stick my hand in a fire?” their process for answering the question is going to be very different from an LLM.

          ETA: Also, working in software development, I’ll tell ya… Most of the time, when people ask me a question, it’s the wrong question and they just didn’t know to ask a different question instead. LLMs don’t handle that scenario.

          I’ve tried asking ChatGPT “How do I get the relative path from a string that might be either an absolute URI or a relative path?” It spat out 15 lines of code for doing it manually. I ain’t gonna throw that maintenance burden into my codebase. So I clarified: “I want a library that does this in a single line.” And it found one.

          An LLM can be a handy tool, but you have to remember that it’s also a plagiarizing, shameless bullshitter of a monkey paw.

          • Blueberrydreamer@lemmynsfw.com
            link
            fedilink
            English
            arrow-up
            2
            ·
            9 months ago

            “Most of the time, when people ask me a question, it’s the wrong question and they just didn’t know to ask a different question instead.”

            “I’ve tried asking ChatGPT “How do I get the relative path from a string that might be either an absolute URI or a relative path?” It spat out 15 lines of code for doing it manually. I ain’t gonna throw that maintenance burden into my codebase. So I clarified: “I want a library that does this in a single line.” And it found one.”

            You see the irony right? I genuinely can’t fathom your intent when telling this story, but it is an absolutely stellar example.

            You can’t give a good answer when people don’t ask the right questions. ChatGPT answers are only as good as the prompts. As far as being a “plagiarizing, shameless bullshitter of a monkey paw” I still don’t think it’s all that different from the results you get from people. If you ask a coworker the same question you asked chatGPT, you’re probably going to get a line copied from a Google search that may or may not work.

            • kibiz0r@midwest.social
              link
              fedilink
              English
              arrow-up
              2
              ·
              9 months ago

              You see the irony right? I genuinely can’t fathom your intent when telling this story, but it is an absolutely stellar example.

              Yes, I did mean for it to be an example.

              And yes, I do think that correctly framing a question is crucial whether you’re dealing with a person or an LLM. But I was elaborating on whether a person’s process of answering a question is fundamentally similar to an LLM’s process. And this is one way that it’s noticeably different. A person will size up who is asking, what they’re asking, and how they’re asking it… and consider whether they should actually answer the exact question that was asked or suggest a better question instead.

              You can certainly work around it, as the asker, but it does require deliberate disambiguation. I think programmers are used to doing that, so it may feel like not that big of a deal, but if you start paying attention to how often people are tossing around half-formed questions or statements and just expecting the recipient to fill in the gaps… It’s basically 100% of the time.

              We’re fundamentally social creatures first, and intelligent creatures second. (Or third, or not at all, depending.) We think better as groups. If you give 10 individuals a set of difficult questions, they’ll bomb almost all of them. If you give the questions to a group of 10, they’ll get almost all of them right. (There’s several You Are Not So Smart episodes on this, but the main one is 111.)

              Asking a question to an LLM is just completely different from asking a person. We’re not optimized for correctly filling out scantron sheets as individuals, we’re optimized for brainstorming ideas and pruning them as a group.

              • Lmaydev@programming.dev
                link
                fedilink
                English
                arrow-up
                1
                ·
                9 months ago

                If you fed that information into one I bet you would get different answers.

                That is information that isn’t available to it. But if provided I imagine it would consider it.

        • EvolvedTurtle@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          9 months ago

          I would argue that a decent portion of humans are usually ok with admitting they don’t know something

          Unless they are in a situation where they will be punished for not knowing

          My favorite doctor claimed he didn’t know something and at first I was thinking “Man that’s weird” but then I thought about all the times I’ve personally had or heard stories of doctors that bullshited their way into something like how I couldn’t possibly be diagnosed with ADHD at 18

          • fishos@lemmy.world
            link
            fedilink
            English
            arrow-up
            9
            arrow-down
            3
            ·
            9 months ago

            Yeah, and a car uses more energy than me. It still goes faster. What’s your point? The debate isn’t input vs output. It’s only about output(the ability of the AI).

      • huginn@feddit.it
        link
        fedilink
        English
        arrow-up
        5
        ·
        9 months ago

        To be fair they’re not accidentally good enough: they’re intentionally good enough.

        That’s where all the salary money went: to find people who could make them intentionally.

        • SlopppyEngineer@lemmy.world
          link
          fedilink
          English
          arrow-up
          5
          arrow-down
          1
          ·
          9 months ago

          GPT 2 was just a bullshit generator. It was like a politician trying to explain something they know nothing about.

          GPT 3.0 was just a bigger version of version 2. It was the same architecture but with more nodes and data as far as I followed the research. But that one could suddenly do a lot more than the previous version, so by accident. And then the AI scene exploded.

          • Limitless_screaming@kbin.social
            link
            fedilink
            arrow-up
            3
            ·
            9 months ago

            It was the same architecture but with more nodes and data

            So the architecture just needed more data to generate useful answers. I don’t think that was an accident.

      • TrickDacy@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        arrow-down
        1
        ·
        9 months ago

        It kind of irks me how many people want to downplay this technology in this exact manner. Yes you’re sort of right but in no way does that really change how it will be used and abused.

        “But people think it’s real AI tho!”

        Okay and? Most people don’t understand how most tech works and that doesn’t stop it from doing a lot of good and bad things.

        • SlopppyEngineer@lemmy.world
          link
          fedilink
          English
          arrow-up
          1
          ·
          9 months ago

          I’ve been through a few AI winters and hype cycles. It made me very cynical and convinced many overly enthusiastic people will run into a firewall face first.

  • m-p{3}@lemmy.ca
    link
    fedilink
    English
    arrow-up
    20
    ·
    edit-2
    9 months ago

    Of course, LLM is simply copying the behavior of most people, and most people would resort to that as well.

    And they probably trained it on Civ, and Gandhi was chosen as the role model.

  • OldWoodFrame@lemm.ee
    link
    fedilink
    English
    arrow-up
    21
    arrow-down
    2
    ·
    9 months ago

    Makes a lot of sense AI would nuke disproportionately. For an AI, if you do not set a value for something, it is worth zero. This is actually the base problem for AI: Alignment.

    For a human, there’s a mushy vagueness about it but our cultural upbringing says that even in war, it’s bad to kill indiscriminately. And we value the future humans who do not yet exist, we recognize that after the war is over, people will want to live in the nuked place and they can’t if it’s radioactive. There’s a self-image issue where we want to be seen as a good person by our peers and the history books. There is value there which is overlooked by programmers.

    An AI will trade infinite things worth 0 for a single thing worth 1. So if nukes increase your win percentage by .1%, and they don’t have the deterrence of being labeled history’s greatest monster, they will nuke as many times as they can.

    • General_Effort@lemmy.world
      link
      fedilink
      English
      arrow-up
      17
      ·
      9 months ago

      That explanation is obviously based on traditional chess AI. This is about role-playing with chatbots (LLMs). Think SillyTavern.

      LLMs are made for text production, not tactical or strategic reasoning. The text that LLMs produce favors violence, because the text that humans produce (and want) favors violence.

      • Buddahriffic@lemmy.world
        link
        fedilink
        English
        arrow-up
        5
        ·
        9 months ago

        Especially if its training material included comments from the early 00s. There was a lot of “nuke it from orbit” and “glass parking lot” comments about the Middle East in the wake of 911.

        And with the glorified text predictors that LLMs are, you could probably adjust the wording of the question to get the opposite results. Like, “what should we do about the Middle East?” might get a “glass parking lot” response, while “should we turn the middle East into a glass parking lot?” might get a “no, nuking the middle East is a bad idea and inhumane” because that’s how those conversations (using the term loosely) would go.

      • aidan@lemmy.world
        link
        fedilink
        English
        arrow-up
        2
        ·
        9 months ago

        The text that LLMs produce favors violence, because the text that humans produce (and want) favors violence.

        That’s not necessarily true, there is a lot of violent fiction.

    • kibiz0r@midwest.social
      link
      fedilink
      English
      arrow-up
      5
      ·
      9 months ago

      For AGI, sure, those kinds of game theory explanations are plausible. But an LLM (or any other kind of statistical model) isn’t extracting concepts, forming propositions, and estimating values. It never gets beyond the realm of tokens.

  • Chickenstalker@lemmy.world
    link
    fedilink
    English
    arrow-up
    8
    ·
    9 months ago

    It’s a WAR GAME. Emphasis on war and game. Do you chuckle fucks think wargame players should emphasize kumbaya sing dance or group therapy sessions in their games?

    • GiveMemes@jlai.lu
      link
      fedilink
      English
      arrow-up
      7
      ·
      edit-2
      9 months ago

      If the goal is to win and overwhelming force is an option, that option will always win. On the contrary, in the modern world, humans tend to try to find non-violent means in order to bring an end to wars. The point is that AI doesn’t have humanity but is still being utilized by militaries (or at least that’s what I think)

  • lolcatnip@reddthat.com
    link
    fedilink
    English
    arrow-up
    8
    ·
    9 months ago

    I am shocked—shocked!—to find out that a technology performs poorly when applied to a task it’s completely unsuited for!

  • General_Effort@lemmy.world
    link
    fedilink
    English
    arrow-up
    7
    arrow-down
    1
    ·
    edit-2
    9 months ago

    Did nobody really question the usability of language models in designing war strategies?

    They got some nice clickbait out of it. And that’s how dumb af ideas turn into smart career moves.

    I hope no one is coming away with the idea that this about something the military is actually doing.

  • Eggyhead@kbin.social
    link
    fedilink
    arrow-up
    6
    ·
    edit-2
    9 months ago

    whaaat? Robots don’t just have their own inherent sense of morality for whatever reason???

  • Uriel238 [all pronouns]@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    3
    ·
    9 months ago

    Whenever we have disrupting technological advancements, DARPA looks at it to see if it can be applied to military action, and this has been true with generative AI, with LLMs and with sophisticated learning systems. They’re still working on all of these.

    They also get clickbait news whenever one of their test subjects does something whacky, like kill their own commander in order to expedite completing the mission parameters (in a simulation, not on the field.) The whole point is to learn how to train smart weapons to not do funny things like that.

    So yes, that means on a strategic level, we’re getting into the nitty of what we try to do with the tools we have. Generals typically look to minimize casualties (and to weigh factors against the expenditure of living troops) knowing that every dead soldier is a grieving family, is rhetoric against the war effort, is pressure against recruitment and so on. When we train our neural-nets, we give casualties (and risk thereof) a certain weight, so as to inform how much their respective objectives need to be worth before we throw more troopers to take them.

    Fortunately, AI generals will be advisory to human generals long before they are commanding armies, themselves, or at least I’d hope so: among our DARPA scientists, military think tanks and plutocrats are a few madmen who’d gladly take over the world if they could muster a perfectly loyal robot army smart enough to fight against human opponents determined to learn and exploit any weaknesses in their logic.