The usage of the English Language can be used to discern LLM's and ChatGPT
The very usage of the English language can be used to discern if its ChatGPT output, one basic example is the usage of American English where in conversation people in Britain will often use the phrase "Can I talk to you" whereas in America they will often say "Can I talk **with** you".
ChatGPT and LLM's have mainly been trained on text from the English language so there is a bias towards those that use the English language in its training data.
Rarer languages from Africa for example that are used in tribes are not fed into ChatGPT, other languages such as French, German make up less of the training data for ChatGPT, as well people that are multi-lingual have a lot more intelligence when it comes to usage of words used when speaking with another person who is multilingual.
More research on my end is needed for Natural Language Processing because its a whole field of science and there are certain implications for multi-lingual folk because affecting spoken speech... not because of the non-existence microchips in the covid-19 vaccines, how well does ChatGPT really understand French, compared to English?
My guess is that because French is used less in the training data its not able to conceptualise the language as well as English and thus.
But also ChatGPT will used the same common words in English because it does not understand and can not vocalise the English language as well as say an English professor at Oxford.
ChatGPT is simply trained to predict the next word in a sequence and it will never truly grasp the deeper meanings behind certain words or phrases because its simply not human and can not take from the human experience.
It can reason and come up with fancy mathematical formulas, but it will never know what its like to be human and have a human experience.
Some ways mentioned to spot LLM output mentioned in the video linked. https://www.youtube.com/watch?v=d03Tww5n3bg
1. Its not this, its this - meant to covey something deep, but the more you think and consider the text, the more you learn that its just nonsense.
2. The rule of 3 - Example "The raw, the jagged, the kind of truth no one can discern"
3. It formats a question like the truth - The truth? The kicker? And honestly?
Comments
Post a Comment