TACL. Understanding Benchmark Language Under Weakened Formal Semantics

May 21, 2026

1 Min Read

May 21, 2026

1 Min Read

State-of-the-art NLP benchmarks require interpretation of natural language that specifies conditions, procedures, and exceptions, often relying on implicit assumptions and external knowledge. Constructing complete semantic representations with proof-theoretic guarantees is…

Inference, Reasoning Language

lsci

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

May 21, 2026

1 Min Read

0 68

May 21, 2026

1 Min Read

0 68

Evaluating whether large language models (LLMs) capture the structure of natural language beyond local fluency remains an open challenge. Existing evaluation methods, largely based on task performance or short-context behavior,…

Inference, Reasoning Language

lsci

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

May 21, 2026

1 Min Read

0 67

May 21, 2026

1 Min Read

0 67

Large language models (LLMs) such as ChatGPT are increasingly used in the cultural heritage domain for tasks like metadata creation, semantic enrichment, and artwork captioning. Since these tasks depend on…

Language Machine learning

lsci

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

May 21, 2026

1 Min Read

0 63

May 21, 2026

1 Min Read

0 63

Mode collapse is a persistent challenge in generative modeling and appears in autoregressive text generation as behaviors ranging from explicit looping to gradual loss of diversity and premature trajectory convergence….

Language Machine learning

lsci

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

November 18, 2025

1 Min Read

0 186

November 18, 2025

1 Min Read

0 186

Heading Large language models (LLMs) have achieved remarkable progress in naturallanguage generation, yet they continue to display puzzling behaviors—such asrepetition and incoherence—even when exhibiting low perplexity. Thishighlights a key limitation…

Language Machine learning

lsci

TSD 2025. Scale-free Characteristics of Multilingual Legal Texts and the Limitations of LLMs

October 4, 2025

1 Min Read

0 139

October 4, 2025

1 Min Read

0 139

This work presents a comparative analysis of text complexity across domains using scale-free metrics. We quantify linguistic complexity via Heaps’ exponent β (vocabulary growth), Taylor’s exponent α (word-frequency fluctuation scaling),…

Uncategorized

lsci

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

August 10, 2025

1 Min Read

0 279

August 10, 2025

1 Min Read

0 279

This paper proposes formulating Zipf’s meaning-frequency law, the power law between word frequency and the number of meanings, as a relationship between word frequency and contextual diversity. The proposed formulation…

Featured Language

lsci

AAAI 2025. Information-Theoretic Generative Clustering of Documents

April 1, 2025

1 Min Read

0 262

April 1, 2025

1 Min Read

0 262

Clustering is a fundamental technique in machine learning and data mining, offering a powerful lens to understand self-organizing patterns in the real world. At its core, clustering is inherently information-theoretic:…

Featured Language

lsci

Complexity of Language and Its Relation to Inference

June 28, 2024

1 Min Read

0 252

June 28, 2024

1 Min Read

0 252

Documents have complexity from various perspectives, such as compression rate and the degree of fluctuation. The complexity varies depending on the extent to which the document is based on “inference.”…

Inference, Reasoning Language

lsci

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

June 28, 2024

1 Min Read

0 268

June 28, 2024

1 Min Read

0 268

We proposed a stock vector representation called “stock embedding,” obtained using a deep learning framework that utilizes news articles and stock price history. This embedding is applicable to financial problems…

Finance Language

lsci

Page 1 of 3 Next

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

TSD 2025. Scale-free Characteristics of Multilingual Legal Texts and the Limitations of LLMs

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

AAAI 2025. Information-Theoretic Generative Clustering of Documents

Complexity of Language and Its Relation to Inference

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

AAAI 2025. Information-Theoretic Generative Clustering of Documents

JSTAT 2023. Strahler number of natural language sentences in comparison with random trees

Physical Review Research 2024. Correlation dimension of natural language in a statistical manifold

Knowledge-Based Systems 2022. Modeling of financial markets under extreme risks

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

ACM ICAIF 2023. Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation

ACL 2020. Influence of textual data and communication structure on financial prices

Knowledge-Based Systems 2022. Modeling of financial markets under extreme risks

Press ESC to close

Or check our Popular Categories...