TACL. Understanding Benchmark Language Under Weakened Formal Semantics

May 21, 2026

1 Min Read

May 21, 2026

1 Min Read

State-of-the-art NLP benchmarks require interpretation of natural language that specifies conditions, procedures, and exceptions, often relying on implicit assumptions and external knowledge. Constructing complete semantic representations with proof-theoretic guarantees is…

Inference, Reasoning Language

lsci

A

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

May 21, 2026

1 Min Read

0 81

May 21, 2026

1 Min Read

0 81

Evaluating whether large language models (LLMs) capture the structure of natural language beyond local fluency remains an open challenge. Existing evaluation methods, largely based on task performance or short-context behavior,…

Inference, Reasoning Language

lsci

D

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

May 21, 2026

1 Min Read

0 69

May 21, 2026

1 Min Read

0 69

Large language models (LLMs) such as ChatGPT are increasingly used in the cultural heritage domain for tasks like metadata creation, semantic enrichment, and artwork captioning. Since these tasks depend on…

Language Machine learning

lsci

I

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

May 21, 2026

1 Min Read

0 66

May 21, 2026

1 Min Read

0 66

Mode collapse is a persistent challenge in generative modeling and appears in autoregressive text generation as behaviors ranging from explicit looping to gradual loss of diversity and premature trajectory convergence….

Language Machine learning

lsci

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

November 18, 2025

1 Min Read

0 188

November 18, 2025

1 Min Read

0 188

Heading Large language models (LLMs) have achieved remarkable progress in naturallanguage generation, yet they continue to display puzzling behaviors—such asrepetition and incoherence—even when exhibiting low perplexity. Thishighlights a key limitation…

Language Machine learning

lsci

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

August 10, 2025

1 Min Read

0 279

August 10, 2025

1 Min Read

0 279

This paper proposes formulating Zipf’s meaning-frequency law, the power law between word frequency and the number of meanings, as a relationship between word frequency and contextual diversity. The proposed formulation…

Featured Language

lsci

AAAI 2025. Information-Theoretic Generative Clustering of Documents

April 1, 2025

1 Min Read

0 265

April 1, 2025

1 Min Read

0 265

Clustering is a fundamental technique in machine learning and data mining, offering a powerful lens to understand self-organizing patterns in the real world. At its core, clustering is inherently information-theoretic:…

Featured Language

lsci

Complexity of Language and Its Relation to Inference

June 28, 2024

1 Min Read

0 253

June 28, 2024

1 Min Read

0 253

Documents have complexity from various perspectives, such as compression rate and the degree of fluctuation. The complexity varies depending on the extent to which the document is based on “inference.”…

Inference, Reasoning Language

lsci

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

June 28, 2024

1 Min Read

0 271

June 28, 2024

1 Min Read

0 271

We proposed a stock vector representation called “stock embedding,” obtained using a deep learning framework that utilizes news articles and stock price history. This embedding is applicable to financial problems…

Finance Language

lsci

JSTAT 2023. Strahler number of natural language sentences in comparison with random trees

May 10, 2024

1 Min Read

0 255

May 10, 2024

1 Min Read

0 255

The Strahler number was originally proposed to characterize the complexity of river bifurcation and has found various applications. This article proposes computation of the Strahler number’s upper and lower limits…

Complex System Featured

lsci

Language

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

AAAI 2025. Information-Theoretic Generative Clustering of Documents

Complexity of Language and Its Relation to Inference

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

JSTAT 2023. Strahler number of natural language sentences in comparison with random trees

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

AAAI 2025. Information-Theoretic Generative Clustering of Documents

JSTAT 2023. Strahler number of natural language sentences in comparison with random trees

Physical Review Research 2024. Correlation dimension of natural language in a statistical manifold

Knowledge-Based Systems 2022. Modeling of financial markets under extreme risks

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

ACM ICAIF 2023. Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation

ACL 2020. Influence of textual data and communication structure on financial prices

Knowledge-Based Systems 2022. Modeling of financial markets under extreme risks

Press ESC to close

Or check our Popular Categories...

Language