The correlation dimension of natural language is measured by applying the Grassberger-Procaccia algorithm to high-dimensional sequences produced by a large-scale language model. This method, previously…
scaling law
4 Articles
4
Various metrics are considered in terms of whether they characterize different kinds of data. For example, in the case of natural language, metrics that specify…
A generative model is a mathematical formulation that generates a sample similar to real data. Many such models have been proposed using machine learning methods,…
For mathematical models of language, their potential, limitations, and ways of improvement are investigated in terms of whether they reproduce the complex properties of language….