{"id":1188,"date":"2024-12-30T15:43:42","date_gmt":"2024-12-30T14:43:42","guid":{"rendered":"https:\/\/dhlunch.ijp.pan.pl\/?page_id=1188"},"modified":"2025-01-03T19:35:34","modified_gmt":"2025-01-03T18:35:34","slug":"10-01-2025","status":"publish","type":"page","link":"https:\/\/dhlunch.ijppan.pl\/en\/10-01-2025\/","title":{"rendered":"10th January 2025"},"content":{"rendered":"<p><strong>Mandelbrot-Zipf-R\u00e9nyi law<\/strong><\/p>\n<p>Marek Czachor (Politechnika Gda\u0144ska)<\/p>\n<p>If you take any sufficiently long text (or corpus of texts) and count the number of occurrences of each individual word, you will observe an interesting regularity: if dealing with natural language, the graph presenting the arrangement of words in descending frequency order will always take the same shape. This is the so-called Zipf distribution (Zipf, 1935).<br \/>\nThe following three charts show typical corpus data (Shakespeare and Dickens; top charts) and the binding time of carbon monoxide to myoglobin, the protein responsible for the functioning of our muscles (for different temperatures; bottom chart).<br \/>\n<img fetchpriority=\"high\" decoding=\"async\" src=\"https:\/\/dhlunch.ijppan.pl\/wp-content\/uploads\/2024\/12\/3-300x244.png\" alt=\"\" width=\"300\" height=\"244\" class=\"alignnone size-medium wp-image-1191\" srcset=\"https:\/\/dhlunch.ijppan.pl\/wp-content\/uploads\/2024\/12\/3-300x244.png 300w, https:\/\/dhlunch.ijppan.pl\/wp-content\/uploads\/2024\/12\/3.png 679w\" sizes=\"(max-width: 300px) 100vw, 300px\" \/><\/p>\n<p>The similarity is obvious, which suggests the existence of a general statistical principle beyond linguistics or molecular biology. In each of the above graphs, three areas can be distinguished: left (horizontal; first bend down), middle (a fragment of a straight line) and right (second bend down). The middle area is described by classic Zipf&#8217;s law (Zipf, 1935). The first and second areas are jointly described by Zipf-Mandelbrot&#8217;s law (Mandelbrot, 1965). We are interested in the third area, or rather the law which would cover all three areas, because the commonly used formulas do not explain why the line of the graph &#8220;collapses&#8221; at the bottom. What is more, we do not want to simply guess a certain mathematical formula, but to derive it from general principles.<\/p>\n<p>It turns out (Czachor-Naudts, 2002) that the &#8220;first cause&#8221; may be one of the basic principles of thermodynamics, namely the process of achieving the so-called thermodynamic equilibrium \u2013 a phenomenon known to us from everyday life as the cooling down of unfinished coffee. In the case of Zipf&#8217;s law, the &#8220;trick&#8221; consists of properly defining the mean value, which ultimately leads to the entropy of R\u00e9nyi (R\u00e9nyi, 1960). The concept of entropy plays a key role both in the theory of thermodynamics (Clausius, 1865) and in the theory of information (Shannon, 1948). In both of these theories, it refers to the level of uncertainty or dispersion of the system.<\/p>\n<p>Thus, one can speak of the eponymous Mandelbrot-Zipf-R\u00e9nyi law, which unifies all three data areas (left: Mandelbrot, middle: Zipf, right: R\u00e9nyi). This law (with no focus on the mathematical details) will be the subject of our meeting.<\/p>\n<p>The meeting will take hybrid form. To participate online please sign up here: <a href=\"https:\/\/forms.gle\/4K1MJ7V9JW8MDKmq7\">https:\/\/forms.gle\/4K1MJ7V9JW8MDKmq7<\/a> Attention, this time the meeting is at 12.00!<\/p>","protected":false},"excerpt":{"rendered":"<p>Mandelbrot-Zipf-R\u00e9nyi law Marek Czachor (Politechnika Gda\u0144ska) If you take any sufficiently long text (or corpus of texts) and count the number of occurrences of each individual word, you will observe &hellip; <a href=\"https:\/\/dhlunch.ijppan.pl\/en\/10-01-2025\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">10th January 2025<\/span><\/a><\/p>\n","protected":false},"author":3,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-1188","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/pages\/1188","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/users\/3"}],"replies":[{"embeddable":true,"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/comments?post=1188"}],"version-history":[{"count":2,"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/pages\/1188\/revisions"}],"predecessor-version":[{"id":1202,"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/pages\/1188\/revisions\/1202"}],"wp:attachment":[{"href":"https:\/\/dhlunch.ijppan.pl\/en\/wp-json\/wp\/v2\/media?parent=1188"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}