TACL. Understanding Benchmark Language Under Weakened Formal Semantics

May 21, 2026

1 Min Read

May 21, 2026

1 Min Read

State-of-the-art NLP benchmarks require interpretation of natural language that specifies conditions, procedures, and exceptions, often relying on implicit assumptions and external knowledge. Constructing complete semantic representations with proof-theoretic guarantees is…

Inference, Reasoning Language

lsci

A

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

May 21, 2026

1 Min Read

0 68

May 21, 2026

1 Min Read

0 68

Evaluating whether large language models (LLMs) capture the structure of natural language beyond local fluency remains an open challenge. Existing evaluation methods, largely based on task performance or short-context behavior,…

Inference, Reasoning Language

lsci

D

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

May 21, 2026

1 Min Read

0 67

May 21, 2026

1 Min Read

0 67

Large language models (LLMs) such as ChatGPT are increasingly used in the cultural heritage domain for tasks like metadata creation, semantic enrichment, and artwork captioning. Since these tasks depend on…

Language Machine learning

lsci

I

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

May 21, 2026

1 Min Read

0 63

May 21, 2026

1 Min Read

0 63

Mode collapse is a persistent challenge in generative modeling and appears in autoregressive text generation as behaviors ranging from explicit looping to gradual loss of diversity and premature trajectory convergence….

Language Machine learning

lsci

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

November 18, 2025

1 Min Read

0 186

November 18, 2025

1 Min Read

0 186

Heading Large language models (LLMs) have achieved remarkable progress in naturallanguage generation, yet they continue to display puzzling behaviors—such asrepetition and incoherence—even when exhibiting low perplexity. Thishighlights a key limitation…

Language Machine learning

lsci

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

August 10, 2025

1 Min Read

0 279

August 10, 2025

1 Min Read

0 279

This paper proposes formulating Zipf’s meaning-frequency law, the power law between word frequency and the number of meanings, as a relationship between word frequency and contextual diversity. The proposed formulation…

Featured Language

lsci

AAAI 2025. Information-Theoretic Generative Clustering of Documents

April 1, 2025

1 Min Read

0 262

April 1, 2025

1 Min Read

0 262

Clustering is a fundamental technique in machine learning and data mining, offering a powerful lens to understand self-organizing patterns in the real world. At its core, clustering is inherently information-theoretic:…

Featured Language

lsci

ACM ICAIF 2023. Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation

May 9, 2024

1 Min Read

0 219

May 9, 2024

1 Min Read

0 219

This paper shows a novel machine learning model for realized volatility (RV) prediction using a normalizing flow, an invertible neural network. Since RV is known to be skewed and have a…

Finance Machine learning

lsci

ICML 2024. Bottleneck-minimal indexing for generative document retrieval

May 9, 2024

1 Min Read

0 355

May 9, 2024

1 Min Read

0 355

We apply an information-theoretic perspective to reconsider generative document retrieval (GDR), in which a document x∈X is indexed by t∈T, and a neural autoregressive model is trained to map queries Q to T. GDR…

Language Machine learning

lsci

Natural Language Engineering 2018. Unsupervised extration of templates from texts

August 10, 2023

1 Min Read

0 263

August 10, 2023

1 Min Read

0 263

Templates are multi-word expressions with slots, such as “Starting at _ on _ ” or “regard _ as _”, that appear frequently in text and also in data from sources…

Language Machine learning

lsci

Machine learning

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

AAAI 2025. Information-Theoretic Generative Clustering of Documents

ACM ICAIF 2023. Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation

ICML 2024. Bottleneck-minimal indexing for generative document retrieval

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

🏆ACL 2025 Outstanding Paper Award. New Formulation of Zipf’s Meaning-Frequency Law

AAAI 2025. Information-Theoretic Generative Clustering of Documents

JSTAT 2023. Strahler number of natural language sentences in comparison with random trees

Physical Review Research 2024. Correlation dimension of natural language in a statistical manifold

Knowledge-Based Systems 2022. Modeling of financial markets under extreme risks

TACL. Understanding Benchmark Language Under Weakened Formal Semantics

ACL 2026. Repeated Sequences Reveal Gapsbetween Large Language Models and Natural Language

DH 2026. Retrieval-Augmented Description Generation for Ceramic Artworks— Effectiveness of Knowledge-Enhancement by the MuseumMetadata—

ICML 2026. Escaping Mode Collapse in LLM Generation via Geometric Regulation

NeurIPS 2025. Correlation Dimension of Autoregressive Large Language Models

ACL 2020. Stock Embeddings Acquired from News Articles and Price History, and an Application to Portfolio Optimization

ACM ICAIF 2023. Co-Training Realized Volatility Prediction Model with Neural Distributional Transformation

ACL 2020. Influence of textual data and communication structure on financial prices

Knowledge-Based Systems 2022. Modeling of financial markets under extreme risks

Press ESC to close

Or check our Popular Categories...

Machine learning