Ap on UBLabs

Ap on UBLabs https://labs.ub.uni-frankfurt.de/authors/ap/ Recent content in Ap on UBLabs Hugo en labs@ub.uni-frankfurt.de (UBLabs) labs@ub.uni-frankfurt.de (UBLabs) Universitätsbibliothek Frankfurt am Main Thu, 17 Aug 2023 00:00:01 +0200 Data Engineering with luigi - Lessons learned https://labs.ub.uni-frankfurt.de/post/data-engineering-with-luigi-lessons-learned/ Thu, 17 Aug 2023 00:00:01 +0200labs@ub.uni-frankfurt.de (UBLabs) https://labs.ub.uni-frankfurt.de/post/data-engineering-with-luigi-lessons-learned/ Introduction At the UB JCS, we make extensive usage of the Python luigi framework for data engineering. The framework is capable of handling thousands of tasks, calculating non-circular task dependencies, and run over days. Additionally, it provides a convenient web control panel to see, e.g. the task dependencies in a tree diagram or start specific tasks. Although luigi itself supports the user already by enforcing a very specific structure, there are still some things to consider when designing a data pipeline with luigi (for a general introduction, see in a previous post). Common engineering strategies in luigi https://labs.ub.uni-frankfurt.de/post/common-engineering-strategies-in-luigi/ Thu, 29 Jun 2023 14:00:00 +0200labs@ub.uni-frankfurt.de (UBLabs) https://labs.ub.uni-frankfurt.de/post/common-engineering-strategies-in-luigi/ Introduction For many automated data processing tasks within the context of the Specialised Information Services (FID) at the University Library Frankfurt, we use the Python package luigi. This package proves especially useful when a task (e.g. the loading of data into a database) depends on the work of other tasks that have to run successfully, before the next task starts (e.g. first you need to download the data). luigi orchestrates all required tasks and their respective required task(s) and then processes everything for you. Upgrading OJS with the ojs_updater https://labs.ub.uni-frankfurt.de/post/upgrading-ojs-with-the-ojs-updater/ Wed, 21 Dec 2022 12:07:00 +0000labs@ub.uni-frankfurt.de (UBLabs) https://labs.ub.uni-frankfurt.de/post/upgrading-ojs-with-the-ojs-updater/ Introduction At the University Library Frankfurt, we currently host 21 OJS journals, with more to come. Since we apply a strategy that runs only a single journal within an OJS instance, we have to maintain 21 different OJS instances. In order to maintain and manage this multiplicity, we found it important to come up with structures on the server and helper tools. Especially the process of updating a journal instance can be quite tedious, since it involves multiple manual steps and can cause problems when forgetting something in the process. Maintaining applications with external API dependencies with software tests https://labs.ub.uni-frankfurt.de/post/maintaining-applications-with-external-api-dependencies-with-software-tests/ Sun, 31 Jul 2022 00:00:00 +0000labs@ub.uni-frankfurt.de (UBLabs) https://labs.ub.uni-frankfurt.de/post/maintaining-applications-with-external-api-dependencies-with-software-tests/ Introduction When talking about software maintainability, we always imply that you can extend or change a given software (even by another person than the original maintainer) without breaking the code or introducing bugs. This premise assumes that there is a good test suite making sure that your software still works as intended after the change (or with a new version of an external dependency). For example, the complexity of the software in the BIOfid software framework, both in the backend and the frontend, can only be tamed by applying tests that make sure that the code works as intended - now and in the future. Getting the (Semantic) Sense out of a User Query https://labs.ub.uni-frankfurt.de/post/getting-the-semantic-sense-out-of-a-user-query/ Wed, 15 Sep 2021 08:00:00 +0000labs@ub.uni-frankfurt.de (UBLabs) https://labs.ub.uni-frankfurt.de/post/getting-the-semantic-sense-out-of-a-user-query/ The BIOfid Semantic Search Within the BIOfid-project, we create a semantic search portal (hereafter “BIOfid portal”) to help our users to access legacy biodiversity literature more easily. Hence, since the BIOfid portal has a deeper “understanding” of both the texts and the included species, it allows the users to get more relevant documents. Moreover, the BIOfid portal interprets the user query and transform it ad hoc into a graph database query, to learn more about their intention.