{"id":744,"date":"2024-04-29T17:36:41","date_gmt":"2024-04-29T17:36:41","guid":{"rendered":"https:\/\/www.volucer.it\/?p=744"},"modified":"2024-04-29T17:39:52","modified_gmt":"2024-04-29T17:39:52","slug":"ai-in-action-basic-concepts-about-llm","status":"publish","type":"post","link":"https:\/\/www.volucer.it\/?p=744","title":{"rendered":"AI in Action: BASIC CONCEPTS ABOUT LLM"},"content":{"rendered":"\n<p><strong>LLMs - Large Language Models<\/strong><\/p>\n\n\n\n<p>LLMs are neural networks that can process and generate natural language text.<\/p>\n\n\n<div class=\"wp-block-image is-style-default\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM.png\"><img loading=\"lazy\" decoding=\"async\" width=\"806\" height=\"682\" src=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM.png\" alt=\"Midjourney Bot\nAPP\n \u2014 Today at 19:22\nLLMs are neural networks that can process and generate natural language text. - Image #1 @Ahdpea8\" class=\"wp-image-746\" title=\"Midjourney Bot APP  \u2014 Today at 19:22 LLMs are neural networks that can process and generate natural language text. - Image #1 @Ahdpea8\" srcset=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM.png 806w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM-300x254.png 300w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM-768x650.png 768w\" sizes=\"auto, (max-width: 806px) 100vw, 806px\" \/><\/a><figcaption class=\"wp-element-caption\">Midjourney Bot APP \u2014 Today at 19:22  LLMs are neural networks that can process and generate natural language text. - Image #1 @Ahdpea8<\/figcaption><\/figure>\n<\/div>\n\n\n<p><\/p>\n\n\n\n<p><strong>TRAINING PHASE<\/strong><\/p>\n\n\n\n<p>They are trained on a dataset of billions of sentences using unsupervised learning techniques. In the <strong>training process<\/strong> LLMs learn what is the most likely word to came next to the previous one based on huge amount of data.<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><a href=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM_Training.png\"><img loading=\"lazy\" decoding=\"async\" width=\"587\" height=\"382\" src=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM_Training.png\" alt=\"\" class=\"wp-image-749\" style=\"width:731px;height:auto\" srcset=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM_Training.png 587w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/LLM_Training-300x195.png 300w\" sizes=\"auto, (max-width: 587px) 100vw, 587px\" \/><\/a><\/figure>\n<\/div>\n\n\n<p><strong>INPUT BY USER<\/strong><\/p>\n\n\n\n<p>LLMs accept as input a <strong>text prompt<\/strong> by a user and in relation with it <strong>generate<\/strong> in output text, word by word (token by token).<\/p>\n\n\n\n<p><strong>GENERATION OF THE OUTPUT<\/strong><\/p>\n\n\n\n<p>The <strong>generation process<\/strong> consists in predicting the next word on the base of previously generated words. LLMs are trained in doing this without any consciousness which is a prerogative of the human mind.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading has-vivid-cyan-blue-color has-text-color has-link-color wp-elements-dbb2a1719cf02fa807af7a0747f3f889\"><strong>Building a Simple Large Language Model<\/strong><\/h2>\n\n\n\n<p>In this example we use as data the dystopian novel \u201c<em>Nineteen Eighty-Four \u2013 <\/em>1984\u201d by English writer George Orwell, published on 1949.<\/p>\n\n\n\n<p>Using the text of the novel as a data source, the following tables were produced. I show only a part of them:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>Word\/Token<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>Occurrences<\/strong><\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>the<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">6249<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>of<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">3309<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>a<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">2482<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>and<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">2326<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>to<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">2236<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>was<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">2213<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>He<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">1959<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>It<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">1864<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>in<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">1759<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>that<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">1457<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>had<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">1311<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>his<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">1079<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>you<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">1011<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>not<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">827<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>with<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">771<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>as<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">672<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>At<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">654<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>they<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">642<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>for<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">615<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>IS<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">614<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>but<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">611<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>be<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">608<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>on<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">604<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>were<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">583<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>there<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">559<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>Winston<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">526<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>him<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">512<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>i<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">495<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>which<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">443<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>s<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">439<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>one<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">426<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>or<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\">424<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">\u2026<\/td><td class=\"has-text-align-center\" data-align=\"center\">\u2026<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>Word\/Token<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>Word Next<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>Score<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>Probability<\/strong><\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">of<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">743<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,01139<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">It<\/td><td class=\"has-text-align-center\" data-align=\"center\">was<\/td><td class=\"has-text-align-center\" data-align=\"center\">589<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00903<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">in<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">574<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00880<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">He<\/td><td class=\"has-text-align-center\" data-align=\"center\">had<\/td><td class=\"has-text-align-center\" data-align=\"center\">355<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00544<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">he<\/td><td class=\"has-text-align-center\" data-align=\"center\">was<\/td><td class=\"has-text-align-center\" data-align=\"center\">273<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00418<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">on<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">230<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00352<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">was<\/td><td class=\"has-text-align-center\" data-align=\"center\">a<\/td><td class=\"has-text-align-center\" data-align=\"center\">225<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00345<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">there<\/td><td class=\"has-text-align-center\" data-align=\"center\">was<\/td><td class=\"has-text-align-center\" data-align=\"center\">223<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00342<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">to<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">212<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00325<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">O<\/td><td class=\"has-text-align-center\" data-align=\"center\">Brien<\/td><td class=\"has-text-align-center\" data-align=\"center\">205<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00314<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">to<\/td><td class=\"has-text-align-center\" data-align=\"center\">be<\/td><td class=\"has-text-align-center\" data-align=\"center\">203<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00311<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">and<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">203<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00311<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">had<\/td><td class=\"has-text-align-center\" data-align=\"center\">been<\/td><td class=\"has-text-align-center\" data-align=\"center\">202<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00310<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">party<\/td><td class=\"has-text-align-center\" data-align=\"center\">195<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00299<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">at<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">183<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00280<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">that<\/td><td class=\"has-text-align-center\" data-align=\"center\">he<\/td><td class=\"has-text-align-center\" data-align=\"center\">167<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00256<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">from<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">161<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00247<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">with<\/td><td class=\"has-text-align-center\" data-align=\"center\">a<\/td><td class=\"has-text-align-center\" data-align=\"center\">158<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00242<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">did<\/td><td class=\"has-text-align-center\" data-align=\"center\">not<\/td><td class=\"has-text-align-center\" data-align=\"center\">148<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00227<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">that<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">147<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00225<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">of<\/td><td class=\"has-text-align-center\" data-align=\"center\">a<\/td><td class=\"has-text-align-center\" data-align=\"center\">145<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00222<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">of<\/td><td class=\"has-text-align-center\" data-align=\"center\">his<\/td><td class=\"has-text-align-center\" data-align=\"center\">145<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00222<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">out<\/td><td class=\"has-text-align-center\" data-align=\"center\">of<\/td><td class=\"has-text-align-center\" data-align=\"center\">142<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00218<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">was<\/td><td class=\"has-text-align-center\" data-align=\"center\">not<\/td><td class=\"has-text-align-center\" data-align=\"center\">130<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00199<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">with<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">127<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00195<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">he<\/td><td class=\"has-text-align-center\" data-align=\"center\">could<\/td><td class=\"has-text-align-center\" data-align=\"center\">124<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00190<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">it<\/td><td class=\"has-text-align-center\" data-align=\"center\">is<\/td><td class=\"has-text-align-center\" data-align=\"center\">124<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00190<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">in<\/td><td class=\"has-text-align-center\" data-align=\"center\">his<\/td><td class=\"has-text-align-center\" data-align=\"center\">123<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00188<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">in<\/td><td class=\"has-text-align-center\" data-align=\"center\">a<\/td><td class=\"has-text-align-center\" data-align=\"center\">122<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00187<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">They<\/td><td class=\"has-text-align-center\" data-align=\"center\">were<\/td><td class=\"has-text-align-center\" data-align=\"center\">122<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00187<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">seemed<\/td><td class=\"has-text-align-center\" data-align=\"center\">to<\/td><td class=\"has-text-align-center\" data-align=\"center\">115<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00176<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">was<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">110<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00169<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">could<\/td><td class=\"has-text-align-center\" data-align=\"center\">not<\/td><td class=\"has-text-align-center\" data-align=\"center\">109<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00167<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">he<\/td><td class=\"has-text-align-center\" data-align=\"center\">said<\/td><td class=\"has-text-align-center\" data-align=\"center\">109<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00167<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">same<\/td><td class=\"has-text-align-center\" data-align=\"center\">103<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00158<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">for<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">101<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00155<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">by<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">95<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00146<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">for<\/td><td class=\"has-text-align-center\" data-align=\"center\">a<\/td><td class=\"has-text-align-center\" data-align=\"center\">92<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00141<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">into<\/td><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">92<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00141<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">she<\/td><td class=\"has-text-align-center\" data-align=\"center\">had<\/td><td class=\"has-text-align-center\" data-align=\"center\">87<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00133<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">as<\/td><td class=\"has-text-align-center\" data-align=\"center\">though<\/td><td class=\"has-text-align-center\" data-align=\"center\">82<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00126<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">they<\/td><td class=\"has-text-align-center\" data-align=\"center\">had<\/td><td class=\"has-text-align-center\" data-align=\"center\">80<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00123<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">that<\/td><td class=\"has-text-align-center\" data-align=\"center\">it<\/td><td class=\"has-text-align-center\" data-align=\"center\">80<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00123<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">have<\/td><td class=\"has-text-align-center\" data-align=\"center\">been<\/td><td class=\"has-text-align-center\" data-align=\"center\">79<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00121<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">and<\/td><td class=\"has-text-align-center\" data-align=\"center\">a<\/td><td class=\"has-text-align-center\" data-align=\"center\">78<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00120<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">it<\/td><td class=\"has-text-align-center\" data-align=\"center\">had<\/td><td class=\"has-text-align-center\" data-align=\"center\">77<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00118<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">The<\/td><td class=\"has-text-align-center\" data-align=\"center\">other<\/td><td class=\"has-text-align-center\" data-align=\"center\">76<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00116<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">of<\/td><td class=\"has-text-align-center\" data-align=\"center\">them<\/td><td class=\"has-text-align-center\" data-align=\"center\">76<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00116<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">to<\/td><td class=\"has-text-align-center\" data-align=\"center\">him<\/td><td class=\"has-text-align-center\" data-align=\"center\">75<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00115<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">the<\/td><td class=\"has-text-align-center\" data-align=\"center\">telescreen<\/td><td class=\"has-text-align-center\" data-align=\"center\">75<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00115<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">BIG<\/td><td class=\"has-text-align-center\" data-align=\"center\">BROTHER<\/td><td class=\"has-text-align-center\" data-align=\"center\">73<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00112<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">\u2026<\/td><td class=\"has-text-align-center\" data-align=\"center\">\u2026<\/td><td class=\"has-text-align-center\" data-align=\"center\">\u2026<\/td><td class=\"has-text-align-center\" data-align=\"center\">\u2026<\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p>This is a simple diagram to understand how the text is generated word by word.<\/p>\n\n\n\n<p>For example, if I start with BIG, LLM will probably generate BROTHER, and continuing we can produce this sentence:<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>BIG<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>BROTHER<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>was<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>a<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>sort<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>of<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>the<\/strong><\/td><td><strong>thought<\/strong><\/td><td>\u2026<\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">Probability<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00112<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,0005<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00345<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00095<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00104<\/td><td class=\"has-text-align-center\" data-align=\"center\">0,00023<\/td><td>0,00087<\/td><td><br><\/td><\/tr><\/tbody><\/table><\/figure>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading has-vivid-cyan-blue-color has-text-color has-link-color wp-elements-0b8ae84c35d164e536b61f94549b6cad\"><strong>ChatGPT is a LLM<\/strong><\/h2>\n\n\n\n<p>By using \u201c<strong>prompt<\/strong>\u201d mechanism you can ask ChatGPT for what you want using the natural language.<\/p>\n\n\n\n<p>But how ChatGPT \u201cUNDERSTAND\u201d text inserted by the user?<\/p>\n\n\n\n<p>The text is transformed and each word represented by a code that computer can processed.<\/p>\n\n\n\n<p>A way to represent individual words is <strong>Word2Vec<\/strong> technique in natural language processing (NLP), in which each word is represented by a vector (a set of numbers). This helped a computer to assign a meaning to the word.<\/p>\n\n\n\n<p><strong>Word2Vec<\/strong> stands for \u201c<strong>words as vectors<\/strong>\u201d. It means expressing each word in your text corpus in n-dimensional space. The word\u2019s weight in each dimension defines it for the model.<\/p>\n\n\n\n<p>The meaning of the words is based on the context defined by its neighboring words where they are associated.<\/p>\n\n\n\n<p>A simple example of word representation using the Word2Vec approach in two-dimensional space.<\/p>\n\n\n\n<p>Man = [1,4]<\/p>\n\n\n\n<p>Woman = [1,3]<\/p>\n\n\n\n<p>Manager = [4,2]<\/p>\n\n\n\n<p>Actress = [4,1]<\/p>\n\n\n\n<figure class=\"wp-block-table\"><table><tbody><tr><td class=\"has-text-align-center\" data-align=\"center\"><strong>Manager<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\" rowspan=\"2\"><strong>-<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>Man<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\" rowspan=\"2\"><strong>+<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>Woman<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\" rowspan=\"2\"><strong>=<\/strong><\/td><td class=\"has-text-align-center\" data-align=\"center\"><strong>Actress<\/strong><\/td><\/tr><tr><td class=\"has-text-align-center\" data-align=\"center\">[4,2]<\/td><td class=\"has-text-align-center\" data-align=\"center\">[1,4]<\/td><td class=\"has-text-align-center\" data-align=\"center\">[1,3]<\/td><td class=\"has-text-align-center\" data-align=\"center\">[4,1]<\/td><\/tr><\/tbody><\/table><figcaption class=\"wp-element-caption\"><br><br>In the following picture we have the graphic representation.<br><\/figcaption><\/figure>\n\n\n\n<p><\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><a href=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/WordSpace01.png\"><img loading=\"lazy\" decoding=\"async\" width=\"626\" height=\"345\" src=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/WordSpace01.png\" alt=\"\" class=\"wp-image-758\" style=\"width:856px;height:auto\" srcset=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/WordSpace01.png 626w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/WordSpace01-300x165.png 300w\" sizes=\"auto, (max-width: 626px) 100vw, 626px\" \/><\/a><\/figure>\n<\/div>\n\n\n<p>This is what happens when you sent some prompt to ChatGPT.<\/p>\n\n\n\n<ol class=\"wp-block-list\" style=\"list-style-type:lower-alpha\">\n<li>The text is converted and split in tokens;<\/li>\n<\/ol>\n\n\n\n<p class=\"has-text-align-center\">[10,10], [10,31], [10,15], [14,44], [8,5], \u2026<\/p>\n\n\n\n<p class=\"has-text-align-center\">(you, are , an, ICT, specialist, with, a, lot, of, experience)<\/p>\n\n\n\n<ol class=\"wp-block-list\" start=\"2\" style=\"list-style-type:lower-alpha\">\n<li>An algorithm (like ChatGPT) makes some prediction and output text word by word.<\/li>\n<\/ol>\n\n\n\n<p class=\"has-text-align-center\">[10,10],\u2026<\/p>\n\n\n\n<p class=\"has-text-align-center\">(you,can,have, an, important, and, well-paid, job)<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><\/p>\n\n\n\n<h2 class=\"wp-block-heading has-vivid-cyan-blue-color has-text-color has-link-color wp-elements-530b0cfab4cac4084473e7b43541f3c4\"><strong>ChatGPT and PROMPT ENGINEERING<\/strong><\/h2>\n\n\n\n<p>Let us now analyze some techniques to better exploit the potential of ChatGPT.<\/p>\n\n\n\n<p><strong>DIRECTIONAL PROMPTING<\/strong><\/p>\n\n\n\n<p>If you submit the same question to ChatGPT many times, you will likely receive different answers.<\/p>\n\n\n\n<p>How can you use directional prompting in order to get more precise answer?<\/p>\n\n\n\n<p>You have to give more information and to be more descriptive when you define a prompt. You have to give clear instruction. This will help the model to understand of what you want. If you ask for generic question, you receive generic answer.<\/p>\n\n\n\n<p>Generic question:<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q1.png\"><img loading=\"lazy\" decoding=\"async\" width=\"770\" height=\"373\" src=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q1.png\" alt=\"\" class=\"wp-image-763\" srcset=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q1.png 770w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q1-300x145.png 300w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q1-768x372.png 768w\" sizes=\"auto, (max-width: 770px) 100vw, 770px\" \/><\/a><\/figure>\n<\/div>\n\n\n<p>More specific question:<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q2.png\"><img loading=\"lazy\" decoding=\"async\" width=\"763\" height=\"671\" src=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q2.png\" alt=\"\" class=\"wp-image-765\" srcset=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q2.png 763w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q2-300x264.png 300w\" sizes=\"auto, (max-width: 763px) 100vw, 763px\" \/><\/a><\/figure>\n<\/div>\n\n\n<p>More contextual and specific question:<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full is-resized\"><a href=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q3.png\"><img loading=\"lazy\" decoding=\"async\" width=\"589\" height=\"778\" src=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q3.png\" alt=\"\" class=\"wp-image-766\" style=\"width:767px;height:auto\" srcset=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q3.png 589w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q3-227x300.png 227w\" sizes=\"auto, (max-width: 589px) 100vw, 589px\" \/><\/a><\/figure>\n<\/div>\n\n\n<p><\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><strong>OUTPUT FORMATTING<\/strong><\/p>\n\n\n\n<p>If you want to have a specific output or format of the output from ChatGPT, for example CSV (<em>Comma Separated Values<\/em>), Microsoft Excel, Microsoft Word or simply txt or maybe code as well, you have to specify as in the following examples.<\/p>\n\n\n\n<p>We want statistical data in CSV format:<\/p>\n\n\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q4.png\"><img loading=\"lazy\" decoding=\"async\" width=\"841\" height=\"724\" src=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q4.png\" alt=\"\" class=\"wp-image-767\" srcset=\"https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q4.png 841w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q4-300x258.png 300w, https:\/\/www.volucer.it\/wp-content\/uploads\/2024\/04\/Q4-768x661.png 768w\" sizes=\"auto, (max-width: 841px) 100vw, 841px\" \/><\/a><\/figure>\n<\/div>\n\n\n<h2 class=\"wp-block-heading has-vivid-cyan-blue-color has-text-color has-link-color wp-elements-6bea8eb8a4a0fe4b14b14a92d7762bf9\"><strong>BIBLIOGRAPHY\/WEBOGRAPHY<\/strong><\/h2>\n\n\n\n<p><strong>[01]<\/strong> openai.com;<\/p>\n\n\n\n<p><strong>[02]<\/strong> KENNETH WARD CHURCH, <em>Emerging Trends Word2Vec<\/em>, IBM 2016;<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p class=\"has-vivid-cyan-blue-color has-text-color has-link-color wp-elements-a300d3269f1d6c34c9146b19a85e0a80\"><strong>Not generated by AI tools or platforms.<\/strong><\/p>\n\n\n\n<p class=\"has-luminous-vivid-orange-color has-text-color has-link-color wp-elements-182971ae0531969180647c9137f01a48\"><strong>{[(homo scripsit)]}<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>LLMs - Large Language Models LLMs are neural networks that can process and generate natural language text. TRAINING PHASE They are trained on a dataset of billions of sentences using unsupervised learning techniques. In the training process LLMs learn what is the most likely word to came next to the previous one based on huge <a class=\"read-more\" href=\"https:\/\/www.volucer.it\/?p=744\">...continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[29,30,32],"tags":[],"class_list":["post-744","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","category-massive-multimodal-models","category-natural-language-processing-nlp"],"_links":{"self":[{"href":"https:\/\/www.volucer.it\/index.php?rest_route=\/wp\/v2\/posts\/744","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.volucer.it\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.volucer.it\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.volucer.it\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.volucer.it\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=744"}],"version-history":[{"count":22,"href":"https:\/\/www.volucer.it\/index.php?rest_route=\/wp\/v2\/posts\/744\/revisions"}],"predecessor-version":[{"id":773,"href":"https:\/\/www.volucer.it\/index.php?rest_route=\/wp\/v2\/posts\/744\/revisions\/773"}],"wp:attachment":[{"href":"https:\/\/www.volucer.it\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=744"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.volucer.it\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=744"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.volucer.it\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=744"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}