Die chronologische Liste zeigt aktuelle Veröffentlichungen aus dem Forschungsbetrieb der Hochschule Weihenstephan-Triesdorf. Zuständig ist das Zentrum für Forschung und Wissenstransfer (ZFW).
One of the major challenges in time series analysis are changing data distributions, especially when processing data streams. To ensure an up-to-date model delivering useful predictions at all times, model reconfigurations are required to adapt to such evolving streams. For Gaussian processes, this might require the adaptation of the internal kernel expression. In this paper, we present dynamically self-adjusting Gaussian processes by introducing Event Triggered Kernel Adjustments in Gaussian process modelling (ETKA), a novel data stream modelling algorithm that can handle evolving and changing data distributions. To this end, we enhance the recently introduced Adjusting Kernel Search with a novel online change point detection method. Our experiments on simulated data with varying change point patterns suggest a broad applicability of ETKA. On real-world data, ETKA outperforms comparison partners that differ regarding the model adjustment and its refitting trigger in nine respective ten out of 14 cases. These results confirm ETKA's ability to enable a more accurate and, in some settings, also more efficient data stream processing via Gaussian processes.Code availability: https://github.com/JanHuewel/ETKA
Mehr
Indrajit Nanda,
Sarah K Schröder,
Claus Steinlein,
Thomas Haaf,
Eva M. Buhl,
Prof. Dr. Dominik Grimm,
Ralf Weiskirchen
Hepatic stellate cells (HSCs) are also known as lipocytes, fat-storing cells, perisinusoidal cells, or Ito cells. These liver-specific mesenchymal cells represent about 5% to 8% of all liver cells, playing a key role in maintaining the microenvironment of the hepatic sinusoid. Upon chronic liver injury or in primary culture, these cells become activated and transdifferentiate into a contractile phenotype, i.e., the myofibroblast, capable of producing and secreting large quantities of extracellular matrix compounds. Based on their central role in the initiation and progression of chronic liver diseases, cultured HSCs are valuable in vitro tools to study molecular and cellular aspects of liver diseases. However, the isolation of these cells requires special equipment, trained personnel, and in some cases needs approval from respective authorities. To overcome these limitations, several immortalized HSC lines were established. One of these cell lines is CFSC, which was originally established from cirrhotic rat livers induced by carbon tetrachloride. First introduced in 1991, this cell line and derivatives thereof (i.e., CFSC-2G, CFSC-3H, CFSC-5H, and CFSC-8B) are now used in many laboratories as an established in vitro HSC model. We here describe molecular features that are suitable for cell authentication. Importantly, chromosome banding and multicolor spectral karyotyping (SKY) analysis demonstrate that the CFSC-2G genome has accumulated extensive chromosome rearrangements and most chromosomes exist in multiple copies producing a pseudo-triploid karyotype. Furthermore, our study documents a defined short tandem repeat (STR) profile including 31 species-specific markers, and a list of genes expressed in CFSC-2G established by bulk mRNA next-generation sequencing (NGS).
Mehr
Nikita Genze,
Raymond Ajekwe,
Zeynep Güreli,
Prof. Dr. Florian Haselbeck,
Michael Grieb,
Prof. Dr. Dominik Grimm
Weeds are undesired plants in agricultural fields that affect crop yield and quality by competing for nutrients, water, sunlight and space. For centuries, farmers have used several strategies and resources to remove weeds. The use of herbicide is still the most common control strategy. To reduce the amount of herbicide and impact caused by uniform spraying, site-specific weed management (SSWM) through variable rate herbicide application and mechanical weed control have long been recommended. To implement such precise strategies, accurate detection and classification of weeds in crop fields is a crucial first step. Due to the phenotypic similarity between some weeds and crops as well as changing weather conditions, it is challenging to design an automated system for general weed detection. For efficiency, unmanned aerial vehicles (UAV) are commonly used for image capturing. However, high wind pressure and different drone settings have a severe effect on the capturing quality, what potentially results in degraded images, e.g., due to motion blur. In this paper, we investigate the generalization capabilities of Deep Learning methods for early weed detection in sorghum fields under such challenging capturing conditions. For this purpose, we developed weed segmentation models using three different state-of-the-art Deep Learning architectures in combination with residual neural networks as feature extractors.We further publish a manually annotated and expert-curated UAV imagery dataset for weed detection in sorghum fields under challenging conditions. Our results show that our trained models generalize well regarding the detection of weeds, even for degraded captures due to motion blur. An UNet-like architecture with a ResNet-34 feature extractor achieved an F1-score of over 89 % on a hold-out test-set. Further analysis indicate that the trained model performed well in predicting the general plant shape, while most misclassifications appeared at borders of the plants. Beyond that, our approach can detect intra-row weeds without additional information as well as partly occluded plants in contrast to existing research.All data, including the newly generated and annotated UAV imagery dataset, and code is publicly available on GitHub: https://github.com/grimmlab/UAVWeedSegmentation and Mendeley Data: https://doi.org/10.17632/4hh45vkp38.3
Mehr
Maura John,
Prof. Dr. Florian Haselbeck,
Rupashree Dass,
Christoph Malisi,
Christian Dreischer,
Sebastian J Schultheiss,
Prof. Dr. Dominik Grimm
Genomic selection is an integral tool for breeders to accurately select plants directly from genotype data leading to faster and more resource-efficient breeding programs. Several prediction methods have been established in the last few years. These range from classical linear mixed models to complex non-linear machine learning approaches, such as Support Vector Regression, and modern deep learning-based architectures. Many of these methods have been extensively evaluated on different crop species with varying outcomes. In this work, our aim is to systematically compare twelve different phenotype prediction models, including basic genomic selection methods to more advanced deep learning-based techniques. More importantly, we assess the performance of these models on simulated phenotype data as well as on real-world data from Arabidopsis thaliana and two breeding datasets from soy and corn. The synthetic phenotypic data allows us to analyze all prediction models and especially the selected markers under controlled and predefined settings. We show that Bayes B and linear regression models with sparsity constraints perform best under different simulation settings with respect to explained variance. Further, we can confirm results from other studies that there is no superiority of more complex neural network-based architectures for phenotype prediction compared to well established methods. However, on real-world data, for which several prediction models yield comparable results with slight advantages for Elastic Net, this picture is less clear, suggesting that there is a lot of room for future research.
Mehr
Jan D Hüwel,
Prof. Dr. Florian Haselbeck,
Prof. Dr. Dominik Grimm,
Christian Beecks
One of the major challenges in time series analysis are changing data distributions, especially when processing data streams. To ensure an up-to-date model delivering useful predictions at all times, model reconfigurations are required to adapt to such evolving streams. For Gaussian processes, this might require the adaptation of the internal kernel expression. In this paper, we present dynamically self-adjusting Gaussian processes by introducing Event Triggered Kernel Adjustments in Gaussian process modelling (ETKA), a novel data stream modelling algorithm that can handle evolving and changing data distributions. To this end, we enhance the recently introduced Adjusting Kernel Search with a novel online change point detection method. Our experiments on simulated data with varying change point patterns suggest a broad applicability of ETKA. On real-world data, ETKA outperforms comparison partners that differ regarding the model adjustment and its refitting trigger in nine respective ten out of 14 cases. These results confirm ETKA's ability to enable a more accurate and, in some settings, also more efficient data stream processing via Gaussian processes.Code availability: https://github.com/JanHuewel/ETKA
Mehr
Prof. Dr. Dominik Grimm
Towards a better understanding of the genetic architecture of complex traits (2022) Keynote @TüBMI 2022, Tübinger Bioinformatics and Medical Informatics Days 2022 .
Quirin Göttl,
Prof. Dr. Dominik Grimm,
Prof. Dr.-Ing. Jakob Burger
The present work uses reinforcement learning (RL) for automated flowsheet synthesis. The task of synthesizing a flowsheet is reformulated into a two-player game, in which an agent learns by self-play without prior knowledge. The hierarchical RL scheme developed in our previous work (Göttl et al., 2021b) is coupled with an improved training process. The training process is analyzed in detail using the synthesis of ethyl tert-butyl ether (ETBE) as an example. This analysis uncovers how the agent’s evolution is driven by the two-player setup.
Mehr
M.Sc. Sara Omranian,
Zoran Nikoloski,
Prof. Dr. Dominik Grimm
Physically interacting proteins form macromolecule complexes that drive diverse cellular processes. Advances in experimental techniques that capture interactions between proteins provide us with protein-protein interaction (PPI) networks from several model organisms. These datasets have enabled the prediction and other computational analyses of protein complexes. Here we provide a systematic review of the state-of-the-art algorithms for protein complex prediction from PPI networks proposed in the past two decades. The existing approaches that solve this problem are categorized into three groups, including: cluster-quality-based, node affinity-based, and network embedding-based approaches, and we compare and contrast the advantages and disadvantages. We further include a comparative analysis by computing the performance of eighteen methods based on twelve well-established performance measures on four widely used benchmark protein-protein interaction networks. Finally, the limitations and drawbacks of both, current data and approaches, along with the potential solutions in this field are discussed, with emphasis on the points that pave the way for future research efforts in this field.
Mehr
Betreuung der Publikationsseiten
Gerhard Radlmayr
Referent für Wissenstransfer und Forschungskommunikation
Wir verwenden Cookies. Einige sind notwendig für die Funktion der Webseite, andere helfen uns, die Webseite zu verbessern. Um unseren eigenen Ansprüchen beim Datenschutz gerecht zu werden, erfassen wir lediglich anonymisierte Nutzerdaten mit „Matomo“. Um unser Internetangebot für Sie ansprechender zu gestalten, binden wir außerdem externe Inhalte unserer Social-Media-Kanäle ein.