Speech [ABELAB]

Speech

During the last decade, thanks to developments of probabilistic algorithms, large speech databases and high performance computer hardware, speech processing technologies such as speech recognition and speech synthesis have been dramatically improved. We can obtain considerable benefits from the technologies including car navigation and speech retrieval through smartphones. However, the performance is still far from those of human beings and users are not completely satisfied with them. To challenge the problem, we try to focus on higher level behaviors of human beings like emotion and speech acts.

Life log

Recently, the activities of our daily lives could be recorded electronically from various information sources. This is referred to as a life log, which is a massive electronic database of every activity, including cyberspace activities (e.g., web sites visited, keywords for searching, and e-mail content) and real-world activities (e.g., photos, videos, physical locations recorded via wearable GPS, and body movements captured by acceleration sensors). Our research goal is to develop user behavior models using the life log and adaptively optimize functionalities of services and systems for individual users.

Human interface

In big data eras, data mining algorithms play important roles to extract useful information from diverse and huge data. However, algorithms are not the only technology to utilize big data in depth. We believe one of the important technology is human interface that enables users to intuitively understand meaning of analyzed results in trial and error style. Currently, we are trying to visualize synchronously recorded data in life logs to support human memory and to visualize environmental sounds to understand urban life styles.