Complexity of Language and Its Relation to Inference

Documents have complexity from various perspectives, such as compression rate and the degree of fluctuation. The complexity varies depending on the extent to which the document is based on “inference.” For example, a corpus of mathematical proofs has a higher compression rate than literary works. Even within natural language documents, those based on inference, such as legal documents, have properties similar to mathematical proofs. We are investigating the relationship between the degree of inference and complexity, considering the language models necessary for legal documents and software engineering.

Categorized in:

Inference, Reasoning Language

Tagged in:

complexity metrics, compression rate

Complexity of Language and Its Relation to Inference

Leave a Reply Cancel reply

Other Stories

AAAI 2025. Information-Theoretic Generative Clustering of Documents

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

TSD 2025. Scale-free Characteristics of Multilingual Legal Texts and the Limitations of LLMs

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

AAAI 2025. Information-Theoretic Generative Clustering of Documents

JSTAT 2023. Strahler number of natural language sentences in comparison with random trees

Physical Review Research 2024. Correlation dimension of natural language in a statistical manifold

Knowledge-Based Systems 2022. Modeling of financial markets under extreme risks

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

AAAI 2025. Information-Theoretic Generative Clustering of Documents

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

JSTAT 2023. Strahler number of natural language sentences in comparison with random trees

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

ACM ICAIF 2023. Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation

ACL 2020. Influence of textual data and communication structure on financial prices

Knowledge-Based Systems 2022. Modeling of financial markets under extreme risks

Press ESC to close

Or check our Popular Categories...

Leave a Reply Cancel reply

Related Articles

Other Stories

AAAI 2025. Information-Theoretic Generative Clustering of Documents

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization