Conta-me Histórias (Tell me stories) is a tool that allows to automatically generate temporal summarization of news collections. through a friendly user interface that enables anyone to explore and revisit events in the past. To select relevant stories and temporal periods, we rely on YAKE!, a key-phrase extraction algorithm developed by our research team, and event detection methods made available by the research community. Additionally, we offer the engine as an open source package that can be extended to support different datasets or languages. The work described here stems from our participation at the Arquivo.pt 2018 competition, where we have been awarded the first prize.
References
Interactive System for Automatically Generating Temporal Narratives [Article Download]
References
YAKE! Keyword Extraction from Single Documents using Multiple Local Features [Article Download]
A Text Feature Based Automatic Keyword Extraction Method for Single Documents [Article Download]
YAKE! Collection-independent Automatic Keyword Extractor [Article Download]
Time-Matters makes use of the Time-Matters python package to score the relevant temporal expressions found within a single text. By doing this, we offer the users the chance to better understand the narrative temporal part of a text. Time-Matters is available through a demo and a Python package.
References
Time-Matters: Temporal Unfolding of Texts [Article Download]
narrArquivo makes use of the Time-Matters python package to score the relevant temporal expressions found within a single text. By doing this, we offer the users the chance to better understand the narrative temporal part of a text. In comparision to Time-Matters we focus on texts collected from the portuguese web archive (Arquivo.pt). narrArquivo is available through a demo and a Python package.
References
Time-Matters: Temporal Unfolding of Texts [Article Download]
Here we provide two user interfaces so that the research community can test the GTE-Cluster and the GTE-Rank temporal search engine applications. In order to retrieve the query results, we rely on the recently launched Bing Search API (5000 transactions/month allowed) parameterized with the en-US market language parameter to retrieve 50 results per query. The proposed solutions are computationally efficient and can easily be tested online. Although the main motivation of our work is focused on queries with temporal nature, the implemented prototypes allow the execution of any query including non-temporal ones. Below is a detailed description of both user interfaces.
References
GTE-Cluster: A Temporal Search Interface for Implicit Temporal Queries [Article Download]
GTE-Rank: Searching for Implicit Temporal Query Results [Article Download]
Below you can find a number of Python packages made available by our research team.
Conta-me Histórias (Tell me stories) is a tool that allows to automatically generate temporal summarization of news collections. through a friendly user interface that enables anyone to explore and revisit events in the past. Conta-me Histórias is available as an open source Python package that can be extended to support different datasets or languages. The work described here stems from our participation at the Arquivo.pt 2018 competition, where we have been awarded the first prize.
References
Interactive System for Automatically Generating Temporal Narratives [Article Download]
References
YAKE! Keyword Extraction from Single Documents using Multiple Local Features [Article Download]
A Text Feature Based Automatic Keyword Extraction Method for Single Documents [Article Download]
YAKE! Collection-independent Automatic Keyword Extractor [Article Download]
KEP is a Python package that enables to extract keyphrases from documents (single or multiple documents) by applying a number of algorithms, the big majority of which provided by pke an open-source package. Differently from PKE, we provide a ready to run code to extract keyphrases not only from a single document, but also in batch mode (i.e., several documents). More to the point, we consider 20 state-of-the-art datasets from which keyphrases may be extracted, and the corresponding dfs and lda pre-computed models (which constrasts with pke as only semeval-2010 models are made available). KEP is available on Dockerhub (ready to run) or available for download (in which case, some configurations need to be done). In any case, we provide a set of jupyter notebooks to ease the process of extracting keyphrases and evaluating the different algorithms.
References
YAKE! Keyword Extraction from Single Documents using Multiple Local Features [Article Download]
A Text Feature Based Automatic Keyword Extraction Method for Single Documents [Article Download]
YAKE! Collection-independent Automatic Keyword Extractor [Article Download]
Time-Matters (winner of the Fraunhofer Portugal Challenge 2013 PhD Contest) is an algorithm that enables to extract relevant dates from a set of documents or multiple docs. Time-Matters is available as an open source Python package and as a docker image.
References
Campos, R., Duque, J., Cândido, T., Mendes, J., Dias, G., Jorge, A., and Nunes, C. (2021). Time-Matters: Temporal Unfolding of Texts. In: .... (eds), Advances in Information Retrieval. ECIR'21 (Lucca, Italy. March 28 - April 1). Lecture Notes in Computer Science, vol ..., pp. x - x. Springer. [Article Download - To appear]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2017). Identifying Top Relevant Dates for Implicit Time Sensitive Queries. In Information Retrieval Journal. Springer, Vol 20(4), pp 363-398 [Article Download]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2016). GTE-Rank: a Time-Aware Search Engine to Answer Time-Sensitive Queries. In Information Processing & Management an International Journal. Elsevier, Vol 52(2), pp. 273-298 [Article Download]
Campos, R., Dias, G., Jorge, A., and Nunes, C. (2014). GTE-Cluster: A Temporal Search Interface for Implicit Temporal Queries. In M. de Rijke et al. (Eds.), Lecture Notes in Computer Science - Advances in Information Retrieval - 36th European Conference on Information Retrieval (ECIR2014). Amesterdam, Netherlands, 13 - 16 April. (Vol. 8416-2014, pp. 775 - 779) [Article Download]
Campos, R., Jorge, A., Dias, G. and Nunes, C. (2012). Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets. In Proceedings of The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies Macau, China, 04 - 07 December, Vol. 1, pp 1 - 8. IEEE Computer Society Press. [Article Download]
Time-Matters-Query is a package that enables to extract relevant dates from a set of documents or multiple docs given a query. Time-Matters-Query is available as an open source Python package.
References
Campos, R., Duque, J., Cândido, T., Mendes, J., Dias, G., Jorge, A., and Nunes, C. (2021). Time-Matters: Temporal Unfolding of Texts. In: .... (eds), Advances in Information Retrieval. ECIR'21 (Lucca, Italy. March 28 - April 1). Lecture Notes in Computer Science, vol ..., pp. x - x. Springer. [Article Download - To appear]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2017). Identifying Top Relevant Dates for Implicit Time Sensitive Queries. In Information Retrieval Journal. Springer, Vol 20(4), pp 363-398 [Article Download]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2016). GTE-Rank: a Time-Aware Search Engine to Answer Time-Sensitive Queries. In Information Processing & Management an International Journal. Elsevier, Vol 52(2), pp. 273-298 [Article Download]
Campos, R., Dias, G., Jorge, A., and Nunes, C. (2014). GTE-Cluster: A Temporal Search Interface for Implicit Temporal Queries. In M. de Rijke et al. (Eds.), Lecture Notes in Computer Science - Advances in Information Retrieval - 36th European Conference on Information Retrieval (ECIR2014). Amesterdam, Netherlands, 13 - 16 April. (Vol. 8416-2014, pp. 775 - 779) [Article Download]
Campos, R., Jorge, A., Dias, G. and Nunes, C. (2012). Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets. In Proceedings of The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies Macau, China, 04 - 07 December, Vol. 1, pp 1 - 8. IEEE Computer Society Press. [Article Download]
py_heideltime is a python wrapper for the famous Heideltime temporal tagger. Py_heideltime is available as an open source Python package and as a docker image.
References
Strötgen, Gertz: Multilingual and Cross-domain Temporal Tagging. Language Resources and Evaluation, 2013. [Article Download]
py_rule_based is a simple temporal expression detection (mostly year-based) supported by regex rules. Py_rule_based is available as an open source Python package.
Here we make available a number of APIs, so that each software can be easily tested by the research community.
Conta-me Histórias (Tell me stories) is a tool that allows to automatically generate temporal summarization of news collections. through a friendly user interface that enables anyone to explore and revisit events in the past. The work described here stems from our participation at the Arquivo.pt 2018 competition, where we have been awarded the first prize. Conta-me Histórias is available as an API that can be invoked by means of an interface or programatically (through its endoint). In any case, it will always return a JSON file as a result.
References
Interactive System for Automatically Generating Temporal Narratives [Article Download]
References
YAKE! Keyword Extraction from Single Documents using Multiple Local Features [Article Download]
A Text Feature Based Automatic Keyword Extraction Method for Single Documents [Article Download]
YAKE! Collection-independent Automatic Keyword Extractor [Article Download]
Time-Matters (winner of the Fraunhofer Portugal Challenge 2013 PhD Contest) is an algorithm that enables to extract relevant dates from a set of documents or multiple docs. Time-Matters is available as an API that can be invoked by means of an interface or programatically (through its endoint). In any case, it will always return a JSON file as a result.
References
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2017). Identifying Top Relevant Dates for Implicit Time Sensitive Queries. In Information Retrieval Journal. Springer, Vol 20(4), pp 363-398 [Article Download]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2016). GTE-Rank: a Time-Aware Search Engine to Answer Time-Sensitive Queries. In Information Processing & Management an International Journal. Elsevier, Vol 52(2), pp. 273-298 [Article Download]
Campos, R., Dias, G., Jorge, A., and Nunes, C. (2014). GTE-Cluster: A Temporal Search Interface for Implicit Temporal Queries. In M. de Rijke et al. (Eds.), Lecture Notes in Computer Science - Advances in Information Retrieval - 36th European Conference on Information Retrieval (ECIR2014). Amesterdam, Netherlands, 13 - 16 April. (Vol. 8416-2014, pp. 775 - 779) [Article Download]
Campos, R., Jorge, A., Dias, G. and Nunes, C. (2012). Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets. In Proceedings of The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technologies Macau, China, 04 - 07 December, Vol. 1, pp 1 - 8. IEEE Computer Society Press. [Article Download]
Below you can find a number of APPs developed by our team.
Conta-me Histórias (Tell me stories) is now available on Google Play
References
Interactive System for Automatically Generating Temporal Narratives [Article Download]
YAKE! is now available on Google Play
References
YAKE! Keyword Extraction from Single Documents using Multiple Local Features [Article Download]
A Text Feature Based Automatic Keyword Extraction Method for Single Documents [Article Download]
YAKE! Collection-independent Automatic Keyword Extractor [Article Download]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2012). GTE: A Distributional Second-Order Co-Occurrence Approach to Improve the Identification of Top Relevant Dates in Web Snippets. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM 2012) Maui, Hawaii, October 29 - November 02, ISBN 978-1-4503-1156-4, pp 2035 - 2039. ACM Press
Campos, R., Jorge, A. and Dias, G. (2011). Using Web Snippets and Query-logs to Measure Implicit Temporal Intents in Queries. In Proceedings of the Query Representation and Understanding Workshop (QRU 2011) associated to 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR2011) Beijing, China, 28 July, pp 13 - 16.
Campos, R., Dias, G. & Jorge, A. (2011). What is the Temporal Value of Web Snippets? In Proceedings of the 1st International Temporal Web Analytics Workshop (TWAW2011) associated to the 20th International World Wide Web Conference (WWW2011), pp 9 – 16, Hyderabad, India, 28th March, ISSN 1613 - 0073.
Campos, R., Dias, G. & Jorge, A. (2011). An Exploratory Study on the impact of Temporal Features on the Classification and Clustering of Future-Related Web Documents. In L. Antunes and H.S. Pinto (Eds.), Lecture Notes in Artificial Intelligence - Progress in Artificial Intelligence, - 15th Portuguese Conference on Artificial Intelligence (EPIA2011) associated to APPIA: Portuguese Association for Artificial Intelligence Lisbon, Portugal, 10 - 13 October. (Vol. 7026-2011, pp. 581 - 596). ISBN: 978-3-642-24768-2. DBLP. Springer. Thomson ISI Web of Knowledge. ACM Press.
Campos, R., Dias, G. & Jorge, A. (2011). What is the Temporal Value of Web Snippets? In Proceedings of the 1st International Temporal Web Analytics Workshop (TWAW2011) associated to the 20th International World Wide Web Conference (WWW2011), pp 9 – 16, Hyderabad, India, 28th March, ISSN 1613 - 0073.
[GTE-Rank Crowdsourcing Experiment Webpage]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2016). GTE-Rank: a Time-Aware Search Engine to Answer Time-Sensitive Queries. In Information Processing & Management an International Journal. Elsevier, Vol 52(2), pp 273-298, ISSN 0306-4573.
[GTE-Rank Temporal Re-Ranking Experiment Webpage]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2016). GTE-Rank: a Time-Aware Search Engine to Answer Time-Sensitive Queries. In Information Processing & Management an International Journal. Elsevier, Vol 52(2), pp 273-298, ISSN 0306-4573.
[GTE-Cluster Flat Temporal Clustering Experiment Webpage]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2012). Disambiguating Implicit Temporal Queries by Clustering Top Relevant Dates in Web Snippets In Proceedings da IEEE Main Conference Proceedings of the 2012 IEEE/WIC/ACM International Conference on Web Intelligence, Macau, China, December 04 – 07.
[WC_DS vs. QLog_DS Experiment Webpage]
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2012). Enriching Temporal Query Understanding through Date Identification: How to Tag Implicit Temporal Queries? In Proceedings of the 2nd International Temporal Web Analytics Workshop (TWAW 2012) associated to 21th International World Wide Web Conference (WWW2012) Lyon, France, 17 April. ISBN 978-1-4503-1188-5, pp 41 – 48. ACM Press.
Campos, R., Dias, G., Jorge, A. and Nunes, C. (2012). GTE: A Distributional Second-Order Co-Occurrence Approach to Improve the Identification of Top Relevant Dates in Web Snippets. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM 2012), Maui, Hawaii, October 29 - November 02, ISBN 978-1-4503-1156-4, pp 2035 - 2039. ACM Press.
[Future Temporal Data Experiment Webpage]
Campos, R., Dias, G. & Jorge, A. (2011). An Exploratory Study on the impact of Temporal Features on the Classification and Clustering of Future-Related Web Documents. In L. Antunes and H.S. Pinto (Eds.), Lecture Notes in Artificial Intelligence - Progress in Artificial Intelligence, - 15th Portuguese Conference on Artificial Intelligence (EPIA2011) associated to APPIA: Portuguese Association for Artificial Intelligence Lisbon, Portugal, 10 - 13 October. (Vol. 7026-2011, pp. 581 - 596). ISBN: 978-3-642-24768-2. DBLP. Springer. Thomson ISI Web of Knowledge. ACM Press.
[Temporal Query Classification Experiment Webpage]
Campos, R., Dias, G. & Jorge, A. (2011). What is the Temporal Value of Web Snippets? In Proceedings of the 1st International Temporal Web Analytics Workshop (TWAW2011) associated to the 20th International World Wide Web Conference (WWW2011), pp 9 – 16, Hyderabad, India, 28th March, ISSN 1613 - 0073.
Campos, R., Dias, G. & Jorge, A. (2011). What is the Temporal Value of Web Snippets? In Proceedings of the 1st International Temporal Web Analytics Workshop (TWAW2011) associated to the 20th International World Wide Web Conference (WWW2011), pp 9 – 16, Hyderabad, India, 28th March, ISSN 1613 - 0073.