Inverted index techniques in music information retrieval: three application scenarios
The rapid advancement of digital technology has resulted in the widespread use of search engines and recommender systems across several industries. In music streaming platforms, these systems are transforming the way we discover and experience music. However, the growing number of users and items av...
| Autor: | |
|---|---|
| Tipo de recurso: | tesis doctoral |
| Estado: | Versión publicada |
| Fecha de publicación: | 2023 |
| País: | Brasil |
| Institución: | Universidade de São Paulo (USP) |
| Repositorio: | Biblioteca Digital de Teses e Dissertações da USP |
| Idioma: | inglés |
| OAI Identifier: | oai:teses.usp.br:tde-07102024-212635 |
| Acceso en línea: | https://www.teses.usp.br/teses/disponiveis/45/45134/tde-07102024-212635/ |
| Access Level: | acceso abierto |
| Palabra clave: | Audio identification Content-based retrieval Identificação de áudio Identificação de versão Indexação Indexing Information retrieval Music information retrieval Ordenação por relevância Recommender systems Recuperação de informação Recuperação de informação musical Scoring techniques Sistemas de recomendação Version identification |
| Sumario: | The rapid advancement of digital technology has resulted in the widespread use of search engines and recommender systems across several industries. In music streaming platforms, these systems are transforming the way we discover and experience music. However, the growing number of users and items available in catalogues represents a significant challenge for these platforms. The Information Retrieval (IR) research field has a long-standing history of scale-related issues, initially driven by text search needs. Over time, this area has evolved to address a wide range of retrieval requirements, offering a range of techniques that can be employed for solving problems in many contexts. Indexing and searching are two complementary activities that form the foundation of IR systems. These systems are usually supported by the inverted index, a central data structure that enables the efficient indexing and retrieval of relevant information. Designing IR systems for non-textual scenarios is not a trivial task. For instance, the accurate scoring and ranking of the retrieved results requires representing the problem as a collection of indexable units. In the field of Music Information Retrieval (MIR), numerous tasks rely on non-textual representations (such as digital audio and music data), presenting distinct challenges for efficient retrieval. This study explores the application of IR techniques for music retrieval through the investigation and assessment of three distinct music tasks where scaling is an important requirement: audio identification, version identification, and music recommendation. The study aims to develop a better understanding of the challenges involved in designing effective IR systems for music purposes, with a particular focus on understanding the effectiveness of typical IR scoring techniques in these tasks. The findings of this study suggest that typical text-based scoring techniques behave as competitive baselines for relevance-based ranking, enabling researchers to focus on representation aspects. Also, the study highlights the importance of evaluating the number of results returned by retrieval systems along with the performance. Also, the study highlights the impact of the number of retrieved items in scalability and efficiency. |
|---|