コラム1 - masaki kurihara

第1回　AIは“記号のパターン”を見ている

AIは、知性があるように振る舞うことが得意なだけで、私たち人間のように「意味」を理解しているわけではありません。

AIの仕組みの根底にあるのは「統計モデル」と呼ばれる考え方です。統計とは、たくさんのデータの中にある規則性（パターン）を見つけ出す方法です。

AIが文章を書いたり、音楽を作ったり、絵を描いたりできるのは、それらすべてを記号の並びとして把握して、そのパターンを分析してマスターしているからです。

例えば「食べ物が美味しい」という自分の感情を誰かに伝えるとき、言葉の配置として「お菓子がすごく美味しい」と並べても「美味しいお菓子すごい」とは並べません。つまり、人間が作る文章や音楽には、一定のパターンという規則性があるのです。

AIは、言葉なら文字や単語の順番、音楽なら音の高さやリズム、絵なら形や色、光の強さなど、それぞれの記号がどのように並びやすいか、どんな関係があるかを確率として学びます。

このような膨大な確率関係を学習したAIは、自然な文を生成できるようになります。つまりAIは、「意味」ではなく「出現の傾向」を理解しているのです。

私たちも経験を通して、言葉のパターンなどを感覚として自然に身につけています。AIはそれを数学的に再現しているのです。ただし、両者の根本的な違いは、人間は「伝えたいこと」を伝えるために文章を作り、その意図と文章を比較して、正しく文が書けたか判断しますが、AIは意味ではなく、パターンの確率に合っているかで判断することです。

なお、この確率による予測の精度は、データの数の多さに強い影響を受けます。例えば、サイコロの出目は1/6ですが、6回振るだけでは、各値が均等には出ることはありません。しかし、偶然6が多く出た後には2が多いなどのブレによる影響は、サイコロを振る数が多ければ多いほど、ズレ同士が相殺され、本来の1/6に近づいていきます。このように言葉などの記号の傾向も、膨大なデータ数を取り扱うことで、どんどんあるべきパターンに近づいていきます。ほかにも仕掛けはありますが、現在は数兆単位でパターンの把握をして、確率の精度を上げています。そのため、現在のChatGPTは、データ量の多さを特徴とする大規模言語モデルといいます。

AIは“世界を記号として見る”機械です。言葉も音も形も、一定の規則を持つ限り、AIにとっては学習可能な対象になります。これが、AIが「創造」できるように見える理由なのです。

コラムトップに戻る

Part 1 – AI Sees Patterns of Symbols

AI is skilled at behaving as if it possesses intelligence,

but it does not “understand meaning” in the way that human beings do.

At the foundation of AI lies what is called a statistical model—

a method for discovering regularities, or patterns, hidden within vast amounts of data.

When AI writes text, composes music, or paints pictures,

it does so by recognizing each of these as a sequence of symbols

and by learning the patterns within those sequences.

For example, when we express the feeling “this food is delicious,”

we naturally say “the cake is really delicious,”

but not “delicious cake really.”

In other words, human language—and likewise music or art—

follows certain structural patterns and regularities.

AI learns these patterns as probabilities:

in language, the order of letters or words;

in music, the pitch and rhythm of sounds;

in images, the relationships among shapes, colors, and brightness.

Through statistical learning, it determines which combinations are likely to appear together.

Having learned an immense number of such probabilistic relationships,

AI can now generate sentences that sound natural to us.

In short, AI does not grasp meaning—

it grasps tendencies of occurrence.

We humans also acquire linguistic patterns through experience, almost intuitively.

AI simply reproduces this process mathematically.

However, there is a crucial difference:

humans create sentences to express an intention,

then evaluate whether their expression communicates that intention accurately.

AI, by contrast, judges its output only by whether it conforms to statistical probability—

not by any sense of meaning.

The accuracy of these probabilistic predictions depends heavily on the quantity of data.

Consider rolling a die: theoretically each number should appear one-sixth of the time,

but if you roll it only six times, the results will likely be uneven.

As the number of rolls increases, random fluctuations cancel each other out,

and the outcome converges toward the true probability of one-sixth.

In the same way, when AI processes language,

the more data it learns from,

the closer its predictions come to capturing the underlying patterns

of how words appear in human communication.

Today’s models handle datasets measured in trillions of tokens,

allowing them to achieve extraordinary precision.

For this reason, systems like ChatGPT are called Large Language Models (LLMs)—

models defined by the sheer scale of the data from which they learn.

AI, then, is a machine that perceives the world as symbols.

Words, sounds, images—anything with an underlying regularity—

can become a learnable pattern for AI.

And this, ultimately, is why AI sometimes appears to create:

not because it understands meaning,

but because it has mastered the art of reproducing the patterns of meaning’s surface.