• ShaunaTheDead@fedia.io
    link
    fedilink
    arrow-up
    249
    ·
    5 months ago

    Reminds me of an early application of AI where scientists were training an AI to tell the difference between a wolf and a dog. It got really good at it in the training data, but it wasn’t working correctly in actual application. So they got the AI to give them a heatmap of which pixels it was using more than any other to determine if a canine is a dog or a wolf and they discovered that the AI wasn’t even looking at the animal, it was looking at the surrounding environment. If there was snow on the ground, it said “wolf”, otherwise it said “dog”.

    • driving_crooner@lemmy.eco.br
      link
      fedilink
      arrow-up
      145
      arrow-down
      1
      ·
      edit-2
      5 months ago

      Early chess engine that used AI, were trained by games of GMs, and the engine would go out of its way to sacrifice the queen, because when GMs do it, it’s comes with a victory.

      • papalonian@lemmy.world
        link
        fedilink
        arrow-up
        37
        ·
        5 months ago

        Reg, why’d you just stab yourself in the shoulder?

        Ah cmon, ain’t ya ever seen a movie?

        Well of course I’ve seen a movie, but what the hell are ya doing?

        Every time the guy stabs himself in a movie, it’s right before he kicks the piss outta the guy he’s fightin’!

        Well that don’t… when that happens, the guys gotta plan Reg, what the hell’s your plan?

        I dunno, but I’m gonna find out!

        • Kazumara@discuss.tchncs.de
          link
          fedilink
          arrow-up
          7
          arrow-down
          1
          ·
          5 months ago

          You don’t use it for the rule-set and allowable moves, but to score board positions.

          For a chess computer calculating all possible moves until the end of the game is not possible in the given time, because the number of potential moves grows exponentially with each further move. So you need to look at a few, and try to reject bad ones early, so that you only calculate further along promising paths.

          So you need to be able to say what is a better board position and what is a worse one. It’s complex to determine - in general - whether a position is better than another. Of course it is, otherwise everyone would just play the “good” positions, and chess would be boring like solved games e.g. Tic-Tac-Toe.

          Now to have your chess computer estimate board positions you can construct tons of rules and heuristics with expert knowledge to hopefully assign sensible values to positions. People do this. But you can also hope that there is some machine learnable patterns in the data that you can discover by feeding historical games and the information on who won into an ML model. People do this too. I think both are fair approaches in this instance.

          • KeenFlame@feddit.nu
            link
            fedilink
            arrow-up
            1
            arrow-down
            1
            ·
            5 months ago

            You can calculate all possible moves in milliseconds on any silicone these dsys

            • Kazumara@discuss.tchncs.de
              link
              fedilink
              arrow-up
              3
              arrow-down
              1
              ·
              edit-2
              5 months ago

              All possible moves one step from a given position sure.

              But if you then take all possible resulting positions and calculate all moves from there, and then take all possible resulting positions after that second move and calculate all possible third moves from there, and so on, then the possibilities explode so much in number that you can’t calculate them anymore. That’s the exponential part I was refering to.

              You can try and estimate them roughly, let’s say you’re somewhere in the middle of the game, there are 12 units of each side still alive. About half are pawns so we take 1.2 possible moves for them, for the others, well let’s say around 8, thats a bit much for horses and the king on average, but probably a bit low for other units. So 6 times 8 and 6 times 1.2, lets call it 55 possibilities. So the first move there are 55 possible positions, for the second you have to consider all of them and their new possibilitues so there are 55 times 55 or 3025, for the third thats 166375, then 9.15 million, 500 million, 27.6 billion, 1.5 trillion etc. That last one was only 7 moves in the future. Most games won’t be finished by then from a given position, so you either need a scoring function or you’re running out of time.

              • KeenFlame@feddit.nu
                link
                fedilink
                arrow-up
                1
                arrow-down
                1
                ·
                5 months ago

                Yep, those are the moves that can all be easily calculated very quickly on modern hardware

                • Dave@lemmy.nz
                  link
                  fedilink
                  arrow-up
                  1
                  ·
                  5 months ago

                  There are more possible chess moves (estimated at 10^120 for an average game) than there are atoms in the observable universe (estimated at 10^80). That is to say the number of possible chess moves has 40 more zeros on the end than the number of atoms in the observable universe.

                  Can you point to some souce showing how modern hardware can work these out easily?

    • kandoh@reddthat.com
      link
      fedilink
      arrow-up
      44
      ·
      5 months ago

      That’s funny because if I was trying to tell the difference between a wolf and a dog I would look for ‘is it in the woods?’ and ‘how big is it relative to what’s around it?’.

    • OsrsNeedsF2P@lemmy.ml
      link
      fedilink
      arrow-up
      15
      arrow-down
      8
      ·
      5 months ago

      While I believe that, it’s an issue with the training data, and not the hardest to resolve

      • dondelelcaro@lemmy.world
        link
        fedilink
        arrow-up
        56
        ·
        5 months ago

        Maybe not the hardest, but still challenging. Unknown biases in training data are a challenge in any experimental design. Opaque ML frequently makes them more challenging to discover.

        • nova_ad_vitum@lemmy.ca
          link
          fedilink
          arrow-up
          25
          ·
          edit-2
          5 months ago

          The unknown biases issue has no real solution. In this same example if instead of something simple like snow in the background, it turned out that the photographs of wolves were taken using zoom lenses (since photogs don’t want to get near wild animals) while the dog photos were closeup and the ML was really just training to recognize subtle photographic artifacts caused by the zoom lenses, this would be extremely difficult to detect let alone prove.

          • dondelelcaro@lemmy.world
            link
            fedilink
            arrow-up
            4
            ·
            5 months ago

            Exactly.

            The general approach is to use interpretable models where you can understand how the model works and what features it uses to discriminate, but that doesn’t work for all ML approaches (and even when it does our understanding is incomplete.)

      • Mirodir@discuss.tchncs.de
        link
        fedilink
        arrow-up
        14
        arrow-down
        1
        ·
        5 months ago

        So is the example with the dogs/wolves and the example in the OP.

        As to how hard to resolve, the dog/wolves one might be quite difficult, but for the example in the OP, it wouldn’t be hard to feed in all images (during training) with randomly chosen backgrounds to remove the model’s ability to draw any conclusions based on background.

        However this would probably unearth the next issue. The one where the human graders, who were probably used to create the original training dataset, have their own biases based on race, gender, appearance, etc. This doesn’t even necessarily mean that they were racist/sexist/etc, just that they struggle to detect certain emotions in certain groups of people. The model would then replicate those issues.

      • merc@sh.itjust.works
        link
        fedilink
        arrow-up
        8
        ·
        5 months ago

        Yes, “Bias Automation” is always an issue with the training data, and it’s always harder to resolve than anyone thinks.

  • Th4tGuyII@fedia.io
    link
    fedilink
    arrow-up
    159
    arrow-down
    2
    ·
    5 months ago

    The idea of AI automated job interviews sickens me. How little of a fuck do you have to give about applicants that you can’t even be bothered to have even a single person interview them??

    • Fisch@discuss.tchncs.de
      link
      fedilink
      English
      arrow-up
      88
      arrow-down
      1
      ·
      5 months ago

      But god forbid the applicant didn’t spend hours researching every little detail about a company, writing a perfect letter with information that could have just been bullet points and being able to explain exactly why they absolutely love the company and why it’s been their dream to work there since they were a child. Or even worse: Use AI to write the application.

      • Melvin_Ferd@lemmy.world
        link
        fedilink
        arrow-up
        34
        ·
        edit-2
        5 months ago

        Cover letters fucking make me hateful. I love generating AI cover letters and sending them. Fuck your cover letters in a market where you need to send 100 applications to get 10 bites

      • Th4tGuyII@fedia.io
        link
        fedilink
        arrow-up
        9
        ·
        5 months ago

        Exactly!

        Applicants are expected to dedicated hours of their time to writing their application and performing background research - both of which are becoming increasingly more tedious over time - so the least a company could bloody do is show some basic respect by paying an actual human being to come interview you!

    • Eager Eagle@lemmy.world
      link
      fedilink
      English
      arrow-up
      15
      ·
      edit-2
      5 months ago

      That’s more like an excuse to keep those stupid 5, 6, and even more interview round processes. Basically making you work an entire week for free in exchange of a chance of getting an offer. Make the first or second rounds with AI and only bother after that.

    • OsrsNeedsF2P@lemmy.ml
      link
      fedilink
      arrow-up
      10
      ·
      5 months ago

      I dunno, but if your boss chain contains a machine (literally Amazon warehouse), does it matter?

    • Riskable@programming.dev
      link
      fedilink
      English
      arrow-up
      85
      ·
      5 months ago

      The base assumption is that you can tell anything reliable at all about a person from their body language, speech patterns, or appearance. So many people think they have an intuition for such things but pretty much every study of such things comes to the same conclusion: You can’t.

      The reason why it doesn’t work is because the world is full of a diverse set of cultures, genetics, and subtle medical conditions. You may be able to attain something like 60% accuracy for certain personality traits from an interview if the person being interviewed was born and raised in the same type of environment/culture (and is the same sex) as you. Anything else is pretty much a guarantee that you’re going to get it wrong.

      That’s why you should only ask interviewees empirical questions that can identify whether or not they have the requisite knowledge to do the job. For example, if you’re hiring an electrical engineer ask them how they would lay out a circuit board. Or if hiring a sales person ask them questions about how they would try to sell your specific product. Or if you’re hiring a union-busting expert person ask them how they sleep at night.

      • snooggums
        link
        fedilink
        English
        arrow-up
        14
        arrow-down
        1
        ·
        5 months ago

        But all the other questions are to find out if they are a good fit for the office culture.

        You know, if they are also white middle class dude bros.

      • Bertuccio@lemmy.world
        link
        fedilink
        arrow-up
        8
        ·
        5 months ago

        I’ve just started doing practical interviews. I basically get really young people with little overall experience and I just want to know if they can do common technical tasks.

        So one question is to literally have them explain how to tighten a bolt. One person failed.

        • Riskable@programming.dev
          link
          fedilink
          English
          arrow-up
          10
          ·
          edit-2
          5 months ago

          To be fair, that’s a very open ended question. I mean, what kind of bolt are we talking about? A standard lag bolt? If so you don’t tighten it! That’d be a trick question! You tighten the nut. Same thing applies with car wheel bolts. Tricky tricky!

          Is it a hex bolt that also has a cross head? How tight are we talking?

          I’m just going to assume bolts of lightning and Usain Bolt are off the table.

          • Aceticon@lemmy.world
            link
            fedilink
            arrow-up
            6
            ·
            edit-2
            5 months ago

            Not really in a bolt tightenning domain, but I have done technical interviews for a lot of devs including junior ones, and them asking all those questions about the task is something I would consider a very good thing.

            At least in my domain the first step of doing a good job is figuring out exactly what needs to be done and in what conditions, so somebody who claims to have some experience who when faced with a somewhat open ended question like this just jumps into the How without first trying to figure out the details of the What is actually a bad sign (or they might just be nervous, so this by itself is not an absolute pass or fail thing).

          • Schadrach@lemmy.sdf.org
            link
            fedilink
            English
            arrow-up
            5
            ·
            5 months ago

            I’m just going to assume bolts of lightning and Usain Bolt are off the table.

            The only thing I know about the procedure for tightening Usain Bolt is that I am not part of performing it.

          • Bertuccio@lemmy.world
            link
            fedilink
            arrow-up
            4
            ·
            5 months ago

            I did actually make the mistake of asking just “which way do you turn a screw” once and the person had the sense to ask “to tighten or loosen it?”

              • Bertuccio@lemmy.world
                link
                fedilink
                arrow-up
                3
                ·
                5 months ago

                Yeah but if they don’t show which is which I ask them to show too.

                Almost everyone gets screw turning right, it just weeds out a few people who say the right things in emails.

      • Xephonian@retrolemmy.com
        link
        fedilink
        arrow-up
        7
        arrow-down
        14
        ·
        5 months ago

        That’s why you should only ask interviewees empirical questions that can identify whether or not they have the requisite knowledge to do the job.

        Hol up. ThAt sOuNds LiKe RaCisM!

  • enkers@sh.itjust.works
    link
    fedilink
    arrow-up
    88
    arrow-down
    2
    ·
    5 months ago

    That shit works IRL too. Why do you think therapy practices often have themselves positioned in front of a wall of books? Not that it’s a bad thing; it’s good for outcomes to believe your therapist is competent and well educated.

  • cley_faye@lemmy.world
    link
    fedilink
    arrow-up
    83
    arrow-down
    2
    ·
    5 months ago

    There’s a ton of great small scale things we can do with machine learning, and even LLM.

    Unfortunately, it seems the main usages will be crushing people down even more.

  • TAG@lemmy.world
    link
    fedilink
    arrow-up
    77
    ·
    5 months ago

    That reminds me of the time, quite a few years ago, Amazon tried to automate resume screening. They trained a machine learning model with anonymized resumes and whether the candidate was hired. Then they looked at what the AI was looking at. The model had trained itself on how to reject women.

    • merc@sh.itjust.works
      link
      fedilink
      arrow-up
      8
      ·
      5 months ago

      Another similar “shortcut” I’ve heard about was that a system that analyzed job performance determined that the two key factors were being named “Jared” and playing lacrosse in high school.

      And, these are the easy-to-figure-out ones we know about.

      If the bias is more complicated, it might never be spotted.

  • fubarx@lemmy.ml
    link
    fedilink
    arrow-up
    70
    arrow-down
    1
    ·
    5 months ago

    Someone should build a little AI app that scrapes a job listing, then takes a resume and rewrites it in subtle ways to perfectly match the job description.

    Let your AI duke it out with their AI.

  • cheddar@programming.dev
    link
    fedilink
    arrow-up
    65
    ·
    5 months ago

    One of my favorite examples is when a company from India (I think?) trained their model to regulate subway gates. The system was supposed to analyze footage and open more gates when there were more people, and vice versa. It worked well until one holiday when there were no people, but all gates were open. They eventually discovered that the system was looking at the clock visible on the video, rather than the number of people.

  • LeadersAtWork@lemmy.world
    link
    fedilink
    arrow-up
    64
    arrow-down
    1
    ·
    edit-2
    5 months ago

    I do that shit when I have a web interview. Put up a guitar just visible in the camera, a small bookshelf, a floor lamp, make sure my tennis bag is visible despite not playing in ages…

    Whether they realize it or not, people do take this stuff in. Not sure why some algorithm based on these very same interviews wouldn’t do the same.

  • UltraGiGaGigantic@lemm.ee
    link
    fedilink
    English
    arrow-up
    60
    arrow-down
    1
    ·
    edit-2
    5 months ago

    Recruiters: “people are using AI to apply! Shame on those lazy wage slaves!”

    Also recruiters:

  • SOB_Van_Owen@lemm.ee
    link
    fedilink
    arrow-up
    53
    arrow-down
    1
    ·
    5 months ago

    One web LLM I was screwing around with had Job Interview as a preset. Ok. Played it totally straight the first time and had a totally positive outcome. Thought the interviewer way too agreeable. The next time I said the most inappropriate stuff I could imagine and still the interviewer agreed to come home with me to check out the rock collection I keep under my bed and listen to Captain Beefheart albums.

  • RoyaltyInTraining@lemmy.world
    link
    fedilink
    arrow-up
    39
    arrow-down
    1
    ·
    5 months ago

    Why are the different scales connected? How exactly does one interpolate between agreeableness and neuroticism? This is the kind of diagram I used to draw as an 8 year old, and they put this crap in a real product…

    • maniclucky@lemmy.world
      link
      fedilink
      arrow-up
      21
      ·
      5 months ago

      They shouldn’t be plotted that way technically. The big 5 are independent traits so they should essentially be sliders, not linked like that.

      That said, it’s way easier to see the points when you do that. Easy to miss when colors swap, for example, without the lines when you’ve been looking at this stuff for a few hours.

      • Pete@feddit.org
        link
        fedilink
        arrow-up
        2
        ·
        5 months ago

        Yeah, it’s interesting that the math pretty much says, that these factors are independent from each other. Then we did even fancier math with “AI”, all to ruin the base understanding by connecting them graphically. It bugs me more than it should. Think about your graphics. It is a very interesting result nevertheless.