ChatGPT gets code questions wrong 52% of the time

[ - ]

12 points

1 year ago

I believe this phenomenon is called “artificial hallucination”. It’s when a language model exceeds its training and makes info out of thin air. All language models have this flaw. Not just ChatGPT.

permalink

report

reply

[ - ]

☆ Yσɠƚԋσʂ ☆@lemmy.mlOP

14 points

1 year ago

The fundamental problem is that at the end of the day it’s just a glorified Markov chain. LLM doesn’t have any actual understanding of what it produces in a human sense, it just knows that particular sets of tokens tend to go together in the data it’s been trained on. GPT mechanic could very well be a useful building block for making learning systems, but a lot more work will need to be done before they can actually be said to understand anything in a meaningful way.

I suspect that to make a real AI we have to embody it in either a robot or a virtual avatar where it would learn to interact with its environment the way a child does. The AI has to build an internal representation of the physical world and its rules. Then we can teach it language using this common context where it would associate words with its understanding of the world. This kind of a shared context is essential for having AI understand things the way we do.

permalink

report

parent

reply

[ - ]

v_krishna@lemmy.ml

4 points

1 year ago

A lot of semantic NLP tried this and it kind of worked but meanwhile statistical correlation won out. It turns out while humans consider semantic understanding to be really important it actually isn’t required for an overwhelming majority of industry use cases. As a Kantian at heart (and an ML engineer by trade) it sucks to recognize this, but it seems like semantic conceptualization as an epiphenomenon emerging from statistical concurrence really might be the way that (at least artificial) intelligence works

permalink

report

parent

reply

[ - ]

☆ Yσɠƚԋσʂ ☆@lemmy.mlOP

2 points

1 year ago

I don’t see the approaches as mutually exclusive. Statistical correlation can get you pretty far, but we’re already seeing a lot of limitations with this approach when it comes to verifying correctness or having the algorithm explain how it came to a particular conclusion. In my view, this makes purely statistical approach inadequate for any situation where there is a specific result desired. For example, an autonomous vehicle has to drive on a road and correctly decide whether there are obstacles around it or not. Failing to do that correctly results in disastrous results and makes purely statistical approaches inherently unsafe.

I think things like GPT could be building blocks for systems that are trained to have semantic understanding. I think what it comes down to is simply training a statistical model against a physical environment until it adjusts its internal topology to create an internal model of the environment through experience. I don’t expect that semantic conceptualization will simply appear out of feeding a bunch of random data into a GPT style system though.

permalink

report

parent

reply

Show more comments

[ - ]

FunkyStuff [he/him]@hexbear.net

4 points

1 year ago

You have a pretty interesting idea that I hadn’t heard elsewhere. Do you know if there’s been any research to make an AI model learn that way?

In my own time while I’ve messed around with some ML stuff, I’ve heard of approaches where you try to get the model to accomplish progressively more complex tasks but in the same domain. For example, if you wanted to train a model to control an agent in a physics simulation to walk like a humanoid you’d have it learn to crawl first, like a real human. I guess for an AGI it makes sense that you would have it try to learn a model of the world across different domains like vision, or sound. Heck, since you can plug any kind of input to it you could have it process radio, infrared, whatever else. That way it could have a very complete model of the world.

permalink

report

parent

reply

[ - ]

☆ Yσɠƚԋσʂ ☆@lemmy.mlOP

-3 points

1 year ago

I’ve seen variations of this idea discussed in a few places, and there is a bunch of research happening around embodiment reinforcement training. A few prominent examples here

What you’re describing is very similar to stuff people have done with genetic algorithms where the agents evolve within the environment, the last link above focuses on that approach. I definitely think this is a really promising approach because we don’t have to figure out the algorithm that produces intelligence that way, but can wait for one to evolve instead.

And yeah, it’s going to be really exciting to plug AI models into different kinds of sensory data, it doesn’t even have to be physical world data. You could plug it into any stream of data that has temporal patterns in it, for example weather data, economic activity, or whatever and the AI will end up building a predictive model of it. I really think this is going to be the way forward for making actual AGI.

permalink

report

parent

reply

[ - ]

GBU_28@lemm.ee

2 points

1 year ago

People are working on this model vetts model, recursive action chain type stuff yea

permalink

report

parent

reply

[ - ]

s20@lemmy.ml

21 points

1 year ago

If I’m going to use AI for something, I want it to be right more often than I am, not just as often!

permalink

report

reply

[ - ]

space_comrade [he/him]@hexbear.net

14 points

1 year ago

It actually doesn’t have to be. For example the way I use Github Copilot is I give it a code snippet to generate and if it’s wrong I just write a bit more code and the it usually gets it right after 2-3 iterations and it still saves me time.

The trick is you should be able to quickly determine if the code is what you want which means you need to have a bit of experience under your belt, so AI is pretty useless if not actively harmful for junior devs.

Overall it’s a good tool if you can get your company to shell out $20 a month for it, not sure if I’d pay it out of my own pocket tho.

permalink

report

parent

reply

[ - ]

s20@lemmy.ml

11 points

1 year ago

It… it was a joke. I was implying that 52% was better than me.

permalink

report

parent

reply

[ - ]

space_comrade [he/him]@hexbear.net

9 points

1 year ago

Ah ok I guess I misread that. My point is that by itself it’s not gonna help you write either better or shittier code than you already do.

permalink

report

parent

reply

[ - ]

jvisick@programming.dev

8 points

1 year ago

GitHub Copilot is just intellisense that can complete longer code blocks.

I’ve found that it can somewhat regularly predict a couple lines of code that generally resemble what I was going to type, but it very rarely gives me correct completions. By a fairly wide margin, I end up needing to correct a piece or two. To your point, it can absolutely be detrimental to juniors or new learners by introducing bugs that are sometimes nastily subtle. I also find it getting in the way only a bit less frequently than it helps.

I do recommend that experienced developers give it a shot because it has been a helpful tool. But to be clear - it’s really only a tool that helps me type faster. By no means does it help me produce better code, and I don’t ever see it full on replacing developers like the doomsayers like to preach. That being said, I think it’s $20 well spent for a company in that it easily saves more than $20 worth of time from my salary each month.

permalink

report

parent

reply

[ - ]

Pleonasm@programming.dev

9 points

1 year ago

I was pretty impressed with it the other day, it converted ~150 lines of Python to C pretty flawlessly. I then asked it to extend the program by adding a progress bar to the program and that segfaulted, but it was immediately able to discover the segfault and fix it when I mentioned. Probably would have taken me an hour or two to write myself and ChatGPT did it in 5 minutes.

permalink

report

reply

[ - ]

possibly a cat@lemmy.ml

2 points

1 year ago

*

Deleted by creator

permalink

report

parent

reply

[ - ]

r00ty@kbin.life

15 points

1 year ago

I used ChatGPT once. It created non functional code. But, the general idea did help me get to where I wanted. Maybe it works better as a rubber duck substitute?

permalink

report

reply

[ - ]

☆ Yσɠƚԋσʂ ☆@lemmy.mlOP

0 points

1 year ago

Yeah, generating some ideas to get you going might be the best use for this kind of stuff.

permalink

report

parent

reply

[ - ]

WarmSoda@lemm.ee

6 points

1 year ago

*

That’s how I view AI generated art. It can come up with some really cool mash ups. But you have to do the rest. Anyone just using what it outputs like that’s the end of the story isn’t ‘using it right’ in my opinion.

permalink

report

parent

reply

[ - ]

☆ Yσɠƚԋσʂ ☆@lemmy.mlOP

-2 points

1 year ago

Right, I expect stuff like stable diffusion will become a part of the toolkit actual artists use. The workflows with this stuff are already getting pretty intricate where people use control net for posing, and inpainting of specific details, and so on. I would liken it to doing photography. You can’t just give a camera to anybody and get good results, it takes a person with a skill and taste to produce an interesting image.

permalink

report

parent

reply

[ - ]

EssentialCoffee@midwest.social

1 point

1 year ago

I’m not sure there’s a way to ‘use art right.’

report

reply

[ - ]

1 point

1 year ago

*

I did my first game jam with the help of chat gpt. It didn’t write any code in the game, but I was able to ask it how to accomplish certain things generally and it would give me ideas and it would be up to me to implement.

There were other things I knew my engine could do but i couldn’t figure out using the documentation, ao I would ask chat gpt “how do you xyz in godot” and it would give me step by step. This was especially useful for the things that get done in the engine ui and not in code.

permalink

report

parent

reply

[ - ]

GBU_28@lemm.ee

2 points

1 year ago

Use it as a boilerplate blaster, for shit you could write yourself

permalink

report

parent