State-of-the-art word embedding methods represent a word with a single vector and presume a linear vector space,…
Mathematical Modeling of Natural Language and Machine Learning
We study mathematical properties of natural language by using the theory of complex systems. The relation between linguistic structure (such as words and grammar) and large-scale properties is investigated from the perspectives of fractals and chaos. Mathematical models of language that reproduce such understanding are explored and are applied to natural language processing. Language has properties in common with other large-scale social systems, such as finance and communication networks. We explore new ways of engineering these social systems by using texts and computing with large-scale resources.
Â
Â
Natural Language as a complex system
Computional linguistics and study of natural language by using the theory of complex systems
- Statistical properties of language
- Methods for measuring nonstationarity and long memory underlying language
- Scaling properties of language
- Quantification of complexity of language
Mathematical language models
Mathematical language models and computational representations of language structure
- Language models that reproduce statistical properties of language
- Embeddings that encode scaling properties
- Mathematical relation between long memory and grammatical structure
- Deep learning methods for sequences with complex properties
Social complex systems via texts
Analysis and prediction of social complex systems via texts
- Embedding methods for social complex system entities
- Deep learning methods for financial time series by using texts
- Analyses of various social complex systems from linguistic perspectives