{"id":1466,"date":"2026-01-27T11:57:43","date_gmt":"2026-01-27T11:57:43","guid":{"rendered":"https:\/\/brain.hr\/?p=1466"},"modified":"2026-02-02T13:57:52","modified_gmt":"2026-02-02T13:57:52","slug":"moze-li-ai-znati-sve-a-ne-znati-razgovarati-3-6","status":"publish","type":"post","link":"https:\/\/brain.hr\/en\/moze-li-ai-znati-sve-a-ne-znati-razgovarati-3-6\/","title":{"rendered":"Can AI know everything and not be able to talk? (3\/6)"},"content":{"rendered":"<p>The first and by far most expensive stage of creating an AI language model, which we call \u201cpre-training,\u201d does not produce the kind of useful assistant (like ChatGPT) we use today. The result of that first stage is the so-called base model.<\/p>\n\n\n\n<p>Experts like Andrej Karpathy describe this form of AI very simply: it is not a smart conversationalist, but just an \u201cextremely expensive text completion system.\u201d<\/p>\n\n\n\n<p>Although this model has \u201cread\u201d the entire Internet and possesses a vast amount of knowledge, it still cannot talk to humans at this stage.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">The \u201cZip File\u201d of the Internet: Knowledge as Statistics<\/h3>\n\n\n\n<p>To understand the nature of the base model, Karpathy offers a brilliant analogy: imagine it as a compressed digital archive of the entire Internet. In its hundreds of billions of parameters, the model has condensed statistical patterns from an incredible 15 trillion basic language units (tokens).<\/p>\n\n\n\n<p>However, it is important to understand that this is<em>lossy compression.<\/em>The model does not memorize texts from the Internet verbatim, like a digital encyclopedia or database, but learns the style of expression. It does not store facts as fixed records, but as statistical probabilities. Therefore, it cannot \u201ccite\u201d the Internet exactly, but generates text that sounds like something that could be found online.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Why does the basic model not participate in a conversation?<\/h3>\n\n\n\n<p>The main reason why the basic language model cannot be a useful assistant is that it does not understand the concept of conversation. It is not programmed to help, but to continue the beginning of a series of words.<\/p>\n\n\n\n<p>If we ask the basic model a simple question, \u201cWhat is 2 + 2?\u201d it will most likely not answer \u201c4\u201d. Instead, it will continue the series as an online document would continue (e.g. a school quiz or knowledge test). It will probably generate new questions such as <em>\u201e<\/em>\u201cWhat is 3 + 3?<em>\u201c<\/em> \" or <em>\u201e<\/em>\u201cWhat is the definition of addition?<em>\u201c<\/em>\u201d. In the basic model, the question is not the beginning of a dialogue, but the beginning of a document that it tries to complete mathematically precisely by imitating the styles it has encountered during its learning.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">From imitator to digital assistant<\/h3>\n\n\n\n<p>Despite possessing an unimaginable amount of information, the basic model is not usable in everyday work by itself. It has no \u201cpersonality\u201d, does not follow instructions and can often wander in unexpected directions because it only imitates samples from the Internet, including those that are less useful or incorrect.<\/p>\n\n\n\n<p>This insight is crucial for the education system because it explains the origin of so-called \u201challucinations\u201d. AI does not \u201clie\u201d intentionally, but simply tries to generate the most statistically likely continuation of a sentence. If that continuation sounds convincing, the model will print it, regardless of whether it is factually based.<\/p>\n\n\n\n<p>It is only in the second, much cheaper and shorter phase, that the simulator turns into a useful assistant that understands instructions and knows how to conduct a meaningful conversation.<\/p>\n\n\n\n<h3 class=\"wp-block-heading\">Read the previous articles in the series:<\/h3>\n\n\n\n<ol class=\"wp-block-list\">\n<li><a href=\"https:\/\/brain.hr\/en\/kako-nastaje-baza-znanja-kojom-se-sluzi-chatgpt\/\">How is the knowledge base used by ChatGPT created?<\/a><\/li>\n\n\n\n<li><a href=\"https:\/\/www.google.com\/search?q=https:\/\/brain.hr\/arhitektura-procesiranja-zasto-najpametniji-modeli-grijese-na-najjednostavnijim-zadacima\/\">Why do the \u201csmartest\u201d models make mistakes on the simplest tasks?<\/a><\/li>\n<\/ol>\n\n\n\n<p><strong>Source:<\/strong> An analysis of Andrej Karpathy\u2019s technical lecture: <a href=\"https:\/\/www.youtube.com\/watch?v=7xTGNNLPyMI\">Deep Dive into LLMs like ChatGPT<\/a>.<\/p>\n\n\n\n<p>U\u00a0sljede\u0107em nastavku pi\u0161emo o procesu post-treninga i saznajemo kako AI prolazi \u201e\u0161kolu lijepog pona\u0161anja\u201c kako bi od simulatora interneta postao digitalni asistent.<\/p>","protected":false},"excerpt":{"rendered":"<p>Prva i daleko najskuplja faza stvaranja jezi\u010dnog modela umjetne inteligencije, koju zovemo &#8220;pred-treniranje&#8221;, ne stvara onog korisnog asistenta (poput ChatGPT-a) kojeg danas koristimo. Rezultat te prve faze je takozvani osnovni model. Stru\u010dnjaci poput Andreja Karpathyja ovaj oblik AI-ja opisuju vrlo jednostavno: on nije pametni sugovornik, ve\u0107 samo \u201eiznimno skupi sustav za automatsko dovr\u0161avanje teksta\u201c. Iako [&hellip;]<\/p>\n","protected":false},"author":8,"featured_media":1468,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"_et_pb_use_builder":"off","_et_pb_old_content":"","_et_gb_content_width":"","footnotes":""},"categories":[14],"tags":[],"class_list":["post-1466","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-radovi"],"acf":{"radovi_source_url":"","radovi_button_label":"Pro\u010ditajte izvorni rad"},"_links":{"self":[{"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/posts\/1466","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/users\/8"}],"replies":[{"embeddable":true,"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/comments?post=1466"}],"version-history":[{"count":6,"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/posts\/1466\/revisions"}],"predecessor-version":[{"id":1567,"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/posts\/1466\/revisions\/1567"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/media\/1468"}],"wp:attachment":[{"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/media?parent=1466"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/categories?post=1466"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/brain.hr\/en\/wp-json\/wp\/v2\/tags?post=1466"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}