All

224 entries « ‹ 1 of 5 › » 
2024
1.
 Gravina, Alessio;  Zambon, Daniele;  Bacciu, Davide;  Alippi, Cesare
Temporal Graph ODEs for Irregularly-Sampled Time Series Conference 
Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI 2024), 2024.
Abstract | Links | BibTeX
@conference{nokey,

title = {Temporal Graph ODEs for Irregularly-Sampled Time Series},

author = {Alessio Gravina and Daniele Zambon and Davide Bacciu and Cesare Alippi},

url = {https://arxiv.org/abs/2404.19508, Arxiv},

year  = {2024},

date = {2024-08-09},

urldate = {2024-08-09},

booktitle = {Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI 2024)},

abstract = {Modern graph representation learning works mostly under the assumption of dealing with regularly sampled temporal graph snapshots, which is far from realistic, e.g., social networks and physical systems are characterized by continuous dynamics and sporadic observations. To address this limitation, we introduce the Temporal Graph Ordinary Differential Equation (TG-ODE) framework, which learns both the temporal and spatial dynamics from graph streams where the intervals between observations are not regularly spaced. We empirically validate the proposed approach on several graph benchmarks, showing that TG-ODE can achieve state-of-the-art performance in irregular graph stream tasks.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
Modern graph representation learning works mostly under the assumption of dealing with regularly sampled temporal graph snapshots, which is far from realistic, e.g., social networks and physical systems are characterized by continuous dynamics and sporadic observations. To address this limitation, we introduce the Temporal Graph Ordinary Differential Equation (TG-ODE) framework, which learns both the temporal and spatial dynamics from graph streams where the intervals between observations are not regularly spaced. We empirically validate the proposed approach on several graph benchmarks, showing that TG-ODE can achieve state-of-the-art performance in irregular graph stream tasks.
Close
Arxiv
Close
2.
 Massidda, Riccardo;  Magliacane, Sara;  Bacciu, Davide
Learning Causal Abstractions of Linear Structural Causal Models Proceedings Article 
In: The 40th Conference on Uncertainty in Artificial Intelligence, 2024.
Links | BibTeX
@inproceedings{massidda2024learning,

title = {Learning Causal Abstractions of Linear Structural Causal Models},

author = {Riccardo Massidda and Sara Magliacane and Davide Bacciu},

url = {https://openreview.net/forum?id=XlFqI9TMhf},

year  = {2024},

date = {2024-07-31},

urldate = {2024-07-31},

booktitle = {The 40th Conference on Uncertainty in Artificial Intelligence},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close
https://openreview.net/forum?id=XlFqI9TMhf
Close
3.
 Gravina, Alessio;  Lovisotto, Giulia;  Gallicchio, Claudio;  Bacciu, Davide;  Grohnfeldt, Claas
Long Range Propagation on Continuous-Time Dynamic Graphs Conference 
Proceedings of the International Conference on Machine Learning (ICML 2024), PMLR, 2024.
Abstract | BibTeX
@conference{nokey,

title = {Long Range Propagation on Continuous-Time Dynamic Graphs},

author = {Alessio Gravina and Giulia Lovisotto and Claudio Gallicchio and Davide Bacciu and Claas Grohnfeldt},

year  = {2024},

date = {2024-07-24},

urldate = {2024-07-24},

booktitle = {Proceedings of the International Conference on Machine Learning (ICML 2024)},

publisher = {PMLR},

abstract = {Learning Continuous-Time Dynamic Graphs (C-TDGs) requires accurately modeling spatio-temporal information on streams of irregularly sampled events. While many methods have been proposed recently, we find that most message passing-, recurrent- or self-attention-based methods perform poorly on long-range tasks. These tasks require correlating information that occurred "far" away from the current event, either spatially (higher-order node information) or along the time dimension (events occurred in the past). To address long-range dependencies, we introduce Continuous-Time Graph Anti-Symmetric Network (CTAN). Grounded within the ordinary differential equations framework, our method is designed for efficient propagation of information. In this paper, we show how CTAN's (i) long-range modeling capabilities are substantiated by theoretical findings and how (ii) its empirical performance on synthetic long-range benchmarks and real-world benchmarks is superior to other methods. Our results motivate CTAN's ability to propagate long-range information in C-TDGs as well as the inclusion of long-range tasks as part of temporal graph models evaluation.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
Learning Continuous-Time Dynamic Graphs (C-TDGs) requires accurately modeling spatio-temporal information on streams of irregularly sampled events. While many methods have been proposed recently, we find that most message passing-, recurrent- or self-attention-based methods perform poorly on long-range tasks. These tasks require correlating information that occurred "far" away from the current event, either spatially (higher-order node information) or along the time dimension (events occurred in the past). To address long-range dependencies, we introduce Continuous-Time Graph Anti-Symmetric Network (CTAN). Grounded within the ordinary differential equations framework, our method is designed for efficient propagation of information. In this paper, we show how CTAN's (i) long-range modeling capabilities are substantiated by theoretical findings and how (ii) its empirical performance on synthetic long-range benchmarks and real-world benchmarks is superior to other methods. Our results motivate CTAN's ability to propagate long-range information in C-TDGs as well as the inclusion of long-range tasks as part of temporal graph models evaluation.
Close
4.
 Bacciu, Davide;  Landolfi, Francesco
Generalizing Convolution to Point Clouds Proceedings Article 
In: ICML 2024 Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators, 2024.
Abstract | Links | BibTeX
@inproceedings{bacciu2024generalizing,

title = {Generalizing Convolution to Point Clouds},

author = {Davide Bacciu and Francesco Landolfi},

url = {https://openreview.net/forum?id=TXwDtUmiaj},

year  = {2024},

date = {2024-07-23},

urldate = {2024-01-01},

booktitle = {ICML 2024 Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators},

abstract = {Convolution, a fundamental operation in deep learning for structured grid data like images, cannot be directly applied to point clouds due to their irregular and unordered nature. Many approaches in literature that perform convolution on point clouds achieve this by designing a convolutional operator from scratch, often with little resemblance to the one used on images. We present two point cloud convolutions that naturally follow from the convolution in its standard definition popular with images. We do so by relaxing the indexing of the kernel weights with a "soft" dictionary that resembles the attention mechanism of the transformers. Finally, experimental results demonstrate the effectiveness of the proposed relaxations on two benchmark point cloud classification tasks.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close
Convolution, a fundamental operation in deep learning for structured grid data like images, cannot be directly applied to point clouds due to their irregular and unordered nature. Many approaches in literature that perform convolution on point clouds achieve this by designing a convolutional operator from scratch, often with little resemblance to the one used on images. We present two point cloud convolutions that naturally follow from the convolution in its standard definition popular with images. We do so by relaxing the indexing of the kernel weights with a "soft" dictionary that resembles the attention mechanism of the transformers. Finally, experimental results demonstrate the effectiveness of the proposed relaxations on two benchmark point cloud classification tasks.
Close
https://openreview.net/forum?id=TXwDtUmiaj
Close
5.
 Trenta, Alessandro;  Bacciu, Davide;  Cossu, Andrea;  Ferrero, Pietro
MultiSTOP: Solving Functional Equations with Reinforcement Learning Proceedings Article 
In: ICLR 2024 Workshop on AI4DifferentialEquations In Science, 2024.
Abstract | BibTeX
@inproceedings{trenta2024multistop,

title = {MultiSTOP: Solving Functional Equations with Reinforcement Learning},

author = {Alessandro Trenta and Davide Bacciu and Andrea Cossu and Pietro Ferrero},

year  = {2024},

date = {2024-05-11},

urldate = {2024-05-11},

booktitle = {ICLR 2024 Workshop on AI4DifferentialEquations In Science},

abstract = {We develop MultiSTOP, a Reinforcement Learning framework for solving functional equations in physics. This new methodology produces actual numerical solutions instead of bounds on them. We extend the original BootSTOP algorithm by adding multiple constraints derived from domain-specific knowledge, even in integral form, to improve the accuracy of the solution. We investigate a particular equation in a one-dimensional Conformal Field Theory.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close
We develop MultiSTOP, a Reinforcement Learning framework for solving functional equations in physics. This new methodology produces actual numerical solutions instead of bounds on them. We extend the original BootSTOP algorithm by adding multiple constraints derived from domain-specific knowledge, even in integral form, to improve the accuracy of the solution. We investigate a particular equation in a one-dimensional Conformal Field Theory.
Close
6.
 Massidda, Martina Cinquini Francesco Landolfi Riccardo
Constraint-Free Structure Learning with Smooth Acyclic Orientations Conference 
The Twelfth International Conference on Learning Representations, 2024.
Abstract | Links | BibTeX
@conference{cosmo2024,

title = {Constraint-Free Structure Learning with Smooth Acyclic Orientations},

author = {Martina Cinquini Francesco Landolfi Riccardo Massidda},

url = {https://openreview.net/forum?id=KWO8LSUC5W},

year  = {2024},

date = {2024-05-06},

urldate = {2024-01-01},

booktitle = {The Twelfth International Conference on Learning Representations},

abstract = {The structure learning problem consists of fitting data generated by a Directed Acyclic Graph (DAG) to correctly reconstruct its arcs. In this context, differentiable approaches constrain or regularize an optimization problem with a continuous relaxation of the acyclicity property. The computational cost of evaluating graph acyclicity is cubic on the number of nodes and significantly affects scalability. In this paper, we introduce COSMO, a constraint-free continuous optimization scheme for acyclic structure learning. At the core of our method lies a novel differentiable approximation of an orientation matrix parameterized by a single priority vector. Differently from previous works, our parameterization fits a smooth orientation matrix and the resulting acyclic adjacency matrix without evaluating acyclicity at any step. Despite this absence, we prove that COSMO always converges to an acyclic solution. In addition to being asymptotically faster, our empirical analysis highlights how COSMO performance on graph reconstruction compares favorably with competing structure learning methods.

},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
The structure learning problem consists of fitting data generated by a Directed Acyclic Graph (DAG) to correctly reconstruct its arcs. In this context, differentiable approaches constrain or regularize an optimization problem with a continuous relaxation of the acyclicity property. The computational cost of evaluating graph acyclicity is cubic on the number of nodes and significantly affects scalability. In this paper, we introduce COSMO, a constraint-free continuous optimization scheme for acyclic structure learning. At the core of our method lies a novel differentiable approximation of an orientation matrix parameterized by a single priority vector. Differently from previous works, our parameterization fits a smooth orientation matrix and the resulting acyclic adjacency matrix without evaluating acyclicity at any step. Despite this absence, we prove that COSMO always converges to an acyclic solution. In addition to being asymptotically faster, our empirical analysis highlights how COSMO performance on graph reconstruction compares favorably with competing structure learning methods.

Close
https://openreview.net/forum?id=KWO8LSUC5W
Close
7.
 Pasquali, Alex;  Lomonaco, Vincenzo;  Bacciu, Davide;  Paganelli, Federica
Deep Reinforcement Learning for Network Slice Placement and the DeepNetSlice Toolkit Conference Forthcoming
Proceedings of the IEEE International Conference on Machine Learning for Communication and Networking 2024 (IEEE ICMLCN 2024), IEEE, Forthcoming.
BibTeX
@conference{nokey,

title = {Deep Reinforcement Learning for Network Slice Placement and the DeepNetSlice Toolkit},

author = {Alex Pasquali and Vincenzo Lomonaco and Davide Bacciu and Federica Paganelli},

year  = {2024},

date = {2024-05-05},

urldate = {2024-05-05},

booktitle = {Proceedings of the IEEE International Conference on Machine Learning for Communication and Networking 2024 (IEEE ICMLCN 2024)},

publisher = {IEEE},

keywords = {},

pubstate = {forthcoming},

tppubtype = {conference}

}

Close
8.
 Ninniri, Matteo;  Podda, Marco;  Bacciu, Davide
Classifier-free graph diffusion for molecular property targeting Workshop 
4th workshop on Graphs and more Complex structures for Learning and Reasoning (GCLR) at AAAI 2024, 2024.
Abstract | Links | BibTeX
@workshop{Ninniri2024,

title = {Classifier-free graph diffusion for molecular property targeting},

author = {Matteo Ninniri and Marco Podda and Davide Bacciu},

url = {https://arxiv.org/abs/2312.17397, Arxiv},

year  = {2024},

date = {2024-02-27},

booktitle = {4th workshop on Graphs and more Complex structures for Learning and Reasoning (GCLR) at AAAI 2024},

abstract = {This work focuses on the task of property targeting: that is, generating molecules conditioned on target chemical properties to expedite candidate screening for novel drug and materials development. DiGress is a recent diffusion model for molecular graphs whose distinctive feature is allowing property targeting through classifier-based (CB) guidance. While CB guidance may work to generate molecular-like graphs, we hint at the fact that its assumptions apply poorly to the chemical domain. Based on this insight we propose a classifier-free DiGress (FreeGress), which works by directly injecting the conditioning information into the training process. CF guidance is convenient given its less stringent assumptions and since it does not require to train an auxiliary property regressor, thus halving the number of trainable parameters in the model. We empirically show that our model yields up to 79% improvement in Mean Absolute Error with respect to DiGress on property targeting tasks on QM9 and ZINC-250k benchmarks. As an additional contribution, we propose a simple yet powerful approach to improve chemical validity of generated samples, based on the observation that certain chemical properties such as molecular weight correlate with the number of atoms in molecules. },

keywords = {},

pubstate = {published},

tppubtype = {workshop}

}

Close
This work focuses on the task of property targeting: that is, generating molecules conditioned on target chemical properties to expedite candidate screening for novel drug and materials development. DiGress is a recent diffusion model for molecular graphs whose distinctive feature is allowing property targeting through classifier-based (CB) guidance. While CB guidance may work to generate molecular-like graphs, we hint at the fact that its assumptions apply poorly to the chemical domain. Based on this insight we propose a classifier-free DiGress (FreeGress), which works by directly injecting the conditioning information into the training process. CF guidance is convenient given its less stringent assumptions and since it does not require to train an auxiliary property regressor, thus halving the number of trainable parameters in the model. We empirically show that our model yields up to 79% improvement in Mean Absolute Error with respect to DiGress on property targeting tasks on QM9 and ZINC-250k benchmarks. As an additional contribution, we propose a simple yet powerful approach to improve chemical validity of generated samples, based on the observation that certain chemical properties such as molecular weight correlate with the number of atoms in molecules. 
Close
Arxiv
Close
9.
 Simone, Lorenzo;  Bacciu, Davide;  Gervasi, Vincenzo
Quasi-Orthogonal ECG-Frank XYZ Transformation with Energy-Based Models and Clinical Text Proceedings Article 
In:  Finkelstein, Joseph;  Moskovitch, Robert;  Parimbelli, Enea (Ed.): Artificial Intelligence in Medicine, pp. 249–253, Springer Nature Switzerland, Cham, 2024, ISBN: 978-3-031-66535-6.
Abstract | BibTeX
@inproceedings{10.1007/978-3-031-66535-6_27,

title = {Quasi-Orthogonal ECG-Frank XYZ Transformation with Energy-Based Models and Clinical Text},

author = {Lorenzo Simone and Davide Bacciu and Vincenzo Gervasi},

editor = {Joseph Finkelstein and Robert Moskovitch and Enea Parimbelli},

isbn = {978-3-031-66535-6},

year  = {2024},

date = {2024-01-01},

booktitle = {Artificial Intelligence in Medicine},

pages = {249–253},

publisher = {Springer Nature Switzerland},

address = {Cham},

abstract = {The transformation of 12-Lead electrocardiograms to 3D vectorcardiograms, along with its reverse process, offer numerous advantages for computer visualization, signal transmission and analysis. Recent literature has shown increasing interest in this structured representation, due to its effectiveness in various cardiac evaluations and machine learning-based arrhythmia prediction. Current transformation techniques utilize fixed matrices, often retrieved through regression methods which fail to correlate with patient's physical characteristics or ongoing diseases. In this paper, we propose the first quasi-orthogonal transformation handling multi-modal input (12-lead ECG and clinical annotations) through a conditional energy-based model. Within our novel probabilistic formulation, the model proposes multiple transformation coefficients without relying on a single fixed approximation to better highlight relationships between latent factors and structured output. The evaluation of our approach, conducted with a nested cross validation on PTB Diagnostic dataset, showcased improved reconstruction precision across various cardiac conditions compared to state-of-the-art techniques (Kors, Dower, and QSLV).},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close
The transformation of 12-Lead electrocardiograms to 3D vectorcardiograms, along with its reverse process, offer numerous advantages for computer visualization, signal transmission and analysis. Recent literature has shown increasing interest in this structured representation, due to its effectiveness in various cardiac evaluations and machine learning-based arrhythmia prediction. Current transformation techniques utilize fixed matrices, often retrieved through regression methods which fail to correlate with patient's physical characteristics or ongoing diseases. In this paper, we propose the first quasi-orthogonal transformation handling multi-modal input (12-lead ECG and clinical annotations) through a conditional energy-based model. Within our novel probabilistic formulation, the model proposes multiple transformation coefficients without relying on a single fixed approximation to better highlight relationships between latent factors and structured output. The evaluation of our approach, conducted with a nested cross validation on PTB Diagnostic dataset, showcased improved reconstruction precision across various cardiac conditions compared to state-of-the-art techniques (Kors, Dower, and QSLV).
Close
10.
 Carta, Antonio;  Cossu, Andrea;  Lomonaco, Vincenzo;  Bacciu, Davide;  Weijer, Joost
Projected Latent Distillation for Data-Agnostic Consolidation in distributed continual learning Journal Article 
In: Neurocomputing, vol. 598, pp. 127935, 2024, ISSN: 0925-2312.
Abstract | Links | BibTeX
@article{CARTA2024127935,

title = {Projected Latent Distillation for Data-Agnostic Consolidation in distributed continual learning},

author = {Antonio Carta and Andrea Cossu and Vincenzo Lomonaco and Davide Bacciu and Joost Weijer},

url = {https://www.sciencedirect.com/science/article/pii/S0925231224007069},

doi = {https://doi.org/10.1016/j.neucom.2024.127935},

issn = {0925-2312},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

journal = {Neurocomputing},

volume = {598},

pages = {127935},

abstract = {In continual learning applications on-the-edge multiple self-centered devices (SCD) learn different local tasks independently, with each SCD only optimizing its own task. Can we achieve (almost) zero-cost collaboration between different devices? We formalize this problem as a Distributed Continual Learning (DCL) scenario, where SCDs greedily adapt to their own local tasks and a separate continual learning (CL) model perform a sparse and asynchronous consolidation step that combines the SCD models sequentially into a single multi-task model without using the original data. Unfortunately, current CL methods are not directly applicable to this scenario. We propose Data-Agnostic Consolidation (DAC), a novel double knowledge distillation method which performs distillation in the latent space via a novel Projected Latent Distillation loss. Experimental results show that DAC enables forward transfer between SCDs and reaches state-of-the-art accuracy on Split CIFAR100, CORe50 and Split TinyImageNet, both in single device and distributed CL scenarios. Somewhat surprisingly, a single out-of-distribution image is sufficient as the only source of data for DAC.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
In continual learning applications on-the-edge multiple self-centered devices (SCD) learn different local tasks independently, with each SCD only optimizing its own task. Can we achieve (almost) zero-cost collaboration between different devices? We formalize this problem as a Distributed Continual Learning (DCL) scenario, where SCDs greedily adapt to their own local tasks and a separate continual learning (CL) model perform a sparse and asynchronous consolidation step that combines the SCD models sequentially into a single multi-task model without using the original data. Unfortunately, current CL methods are not directly applicable to this scenario. We propose Data-Agnostic Consolidation (DAC), a novel double knowledge distillation method which performs distillation in the latent space via a novel Projected Latent Distillation loss. Experimental results show that DAC enables forward transfer between SCDs and reaches state-of-the-art accuracy on Split CIFAR100, CORe50 and Split TinyImageNet, both in single device and distributed CL scenarios. Somewhat surprisingly, a single out-of-distribution image is sufficient as the only source of data for DAC.
Close
https://www.sciencedirect.com/science/article/pii/S0925231224007069
doi:https://doi.org/10.1016/j.neucom.2024.127935
Close
11.
 Cossu, Andrea;  Spinnato, Francesco;  Guidotti, Riccardo;  Bacciu, Davide
Drifting explanations in continual learning Journal Article 
In: Neurocomputing, vol. 597, pp. 127960, 2024, ISSN: 0925-2312.
Abstract | Links | BibTeX
@article{COSSU2024127960,

title = {Drifting explanations in continual learning},

author = {Andrea Cossu and Francesco Spinnato and Riccardo Guidotti and Davide Bacciu},

url = {https://www.sciencedirect.com/science/article/pii/S0925231224007318},

doi = {https://doi.org/10.1016/j.neucom.2024.127960},

issn = {0925-2312},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

journal = {Neurocomputing},

volume = {597},

pages = {127960},

abstract = {Continual Learning (CL) trains models on streams of data, with the aim of learning new information without forgetting previous knowledge. However, many of these models lack interpretability, making it difficult to understand or explain how they make decisions. This lack of interpretability becomes even more challenging given the non-stationary nature of the data streams in CL. Furthermore, CL strategies aimed at mitigating forgetting directly impact the learned representations. We study the behavior of different explanation methods in CL and propose CLEX (ContinuaL EXplanations), an evaluation protocol to robustly assess the change of explanations in Class-Incremental scenarios, where forgetting is pronounced. We observed that models with similar predictive accuracy do not generate similar explanations. Replay-based strategies, well-known to be some of the most effective ones in class-incremental scenarios, are able to generate explanations that are aligned to the ones of a model trained offline. On the contrary, naive fine-tuning often results in degenerate explanations that drift from the ones of an offline model. Finally, we discovered that even replay strategies do not always operate at best when applied to fully-trained recurrent models. Instead, randomized recurrent models (leveraging on an untrained recurrent component) clearly reduce the drift of the explanations. This discrepancy between fully-trained and randomized recurrent models, previously known only in the context of their predictive continual performance, is more general, including also continual explanations.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
Continual Learning (CL) trains models on streams of data, with the aim of learning new information without forgetting previous knowledge. However, many of these models lack interpretability, making it difficult to understand or explain how they make decisions. This lack of interpretability becomes even more challenging given the non-stationary nature of the data streams in CL. Furthermore, CL strategies aimed at mitigating forgetting directly impact the learned representations. We study the behavior of different explanation methods in CL and propose CLEX (ContinuaL EXplanations), an evaluation protocol to robustly assess the change of explanations in Class-Incremental scenarios, where forgetting is pronounced. We observed that models with similar predictive accuracy do not generate similar explanations. Replay-based strategies, well-known to be some of the most effective ones in class-incremental scenarios, are able to generate explanations that are aligned to the ones of a model trained offline. On the contrary, naive fine-tuning often results in degenerate explanations that drift from the ones of an offline model. Finally, we discovered that even replay strategies do not always operate at best when applied to fully-trained recurrent models. Instead, randomized recurrent models (leveraging on an untrained recurrent component) clearly reduce the drift of the explanations. This discrepancy between fully-trained and randomized recurrent models, previously known only in the context of their predictive continual performance, is more general, including also continual explanations.
Close
https://www.sciencedirect.com/science/article/pii/S0925231224007318
doi:https://doi.org/10.1016/j.neucom.2024.127960
Close
12.
 Ceni, Andrea;  Cossu, Andrea;  Stölzle, Maximilian W;  Liu, Jingyue;  Santina, Cosimo Della;  Bacciu, Davide;  Gallicchio, Claudio
Random Oscillators Network for Time Series Processing Proceedings Article 
In: International Conference on Artificial Intelligence and Statistics, pp. 4807–4815, PMLR 2024.
Abstract | BibTeX
@inproceedings{ceni2024random,

title = {Random Oscillators Network for Time Series Processing},

author = {Andrea Ceni and Andrea Cossu and Maximilian W Stölzle and Jingyue Liu and Cosimo Della Santina and Davide Bacciu and Claudio Gallicchio},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

booktitle = {International Conference on Artificial Intelligence and Statistics},

pages = {4807–4815},

organization = {PMLR},

abstract = {We introduce the Random Oscillators Network (RON), a physically-inspired recurrent model derived from a network of heterogeneous oscillators. Unlike traditional recurrent neural networks, RON keeps the connections between oscillators untrained by leveraging on smart random initialisations, leading to exceptional computational efficiency. A rigorous theoretical analysis finds the necessary and sufficient conditions for the stability of RON, highlighting the natural tendency of RON to lie at the edge of stability, a regime of configurations offering particularly powerful and expressive models. Through an extensive empirical evaluation on several benchmarks, we show four main advantages of RON. 1) RON shows excellent long-term memory and sequence classification ability, outperforming other randomised approaches. 2) RON outperforms fully-trained recurrent models and state-of-the-art randomised models in chaotic time series forecasting. 3) RON provides expressive internal representations even in a small parametrisation regime making it amenable to be deployed on low-powered devices and at the edge. 4) RON is up to two orders of magnitude faster than fully-trained models. },

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close
We introduce the Random Oscillators Network (RON), a physically-inspired recurrent model derived from a network of heterogeneous oscillators. Unlike traditional recurrent neural networks, RON keeps the connections between oscillators untrained by leveraging on smart random initialisations, leading to exceptional computational efficiency. A rigorous theoretical analysis finds the necessary and sufficient conditions for the stability of RON, highlighting the natural tendency of RON to lie at the edge of stability, a regime of configurations offering particularly powerful and expressive models. Through an extensive empirical evaluation on several benchmarks, we show four main advantages of RON. 1) RON shows excellent long-term memory and sequence classification ability, outperforming other randomised approaches. 2) RON outperforms fully-trained recurrent models and state-of-the-art randomised models in chaotic time series forecasting. 3) RON provides expressive internal representations even in a small parametrisation regime making it amenable to be deployed on low-powered devices and at the edge. 4) RON is up to two orders of magnitude faster than fully-trained models. 
Close
13.
 Li, Lanpei;  Piccoli, Elia;  Cossu, Andrea;  Bacciu, Davide;  Lomonaco, Vincenzo
Calibration of Continual Learning Models Proceedings Article 
In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4160–4169, 2024.
Abstract | BibTeX
@inproceedings{li2024calibration,

title = {Calibration of Continual Learning Models},

author = {Lanpei Li and Elia Piccoli and Andrea Cossu and Davide Bacciu and Vincenzo Lomonaco},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},

pages = {4160–4169},

abstract = {Continual Learning (CL) focuses on maximizing the predictive performance of a model across a non-stationary stream of data. Unfortunately CL models tend to forget previous knowledge thus often underperforming when compared with an offline model trained jointly on the entire data stream. Given that any CL model will eventually make mistakes it is of crucial importance to build calibrated CL models: models that can reliably tell their confidence when making a prediction. Model calibration is an active research topic in machine learning yet to be properly investigated in CL. We provide the first empirical study of the behavior of calibration approaches in CL showing that CL strategies do not inherently learn calibrated models. To mitigate this issue we design a continual calibration approach that improves the performance of post-processing calibration methods over a wide range of different benchmarks and CL strategies. CL does not necessarily need perfect predictive models but rather it can benefit from reliable predictive models. We believe our study on continual calibration represents a first step towards this direction.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close
Continual Learning (CL) focuses on maximizing the predictive performance of a model across a non-stationary stream of data. Unfortunately CL models tend to forget previous knowledge thus often underperforming when compared with an offline model trained jointly on the entire data stream. Given that any CL model will eventually make mistakes it is of crucial importance to build calibrated CL models: models that can reliably tell their confidence when making a prediction. Model calibration is an active research topic in machine learning yet to be properly investigated in CL. We provide the first empirical study of the behavior of calibration approaches in CL showing that CL strategies do not inherently learn calibrated models. To mitigate this issue we design a continual calibration approach that improves the performance of post-processing calibration methods over a wide range of different benchmarks and CL strategies. CL does not necessarily need perfect predictive models but rather it can benefit from reliable predictive models. We believe our study on continual calibration represents a first step towards this direction.
Close
14.
 Gravina, Alessio;  Bacciu, Davide
Deep Learning for Dynamic Graphs: Models and Benchmarks Journal Article 
In: IEEE Transactions on Neural Networks and Learning Systems, pp. 1-14, 2024.
Abstract | Links | BibTeX
@article{10490120,

title = {Deep Learning for Dynamic Graphs: Models and Benchmarks},

author = {Alessio Gravina and Davide Bacciu},

doi = {10.1109/TNNLS.2024.3379735},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

journal = {IEEE Transactions on Neural Networks and Learning Systems},

pages = {1-14},

abstract = {Recent progress in research on deep graph networks (DGNs) has led to a maturation of the domain of learning on graphs. Despite the growth of this research field, there are still important challenges that are yet unsolved. Specifically, there is an urge of making DGNs suitable for predictive tasks on real-world systems of interconnected entities, which evolve over time. With the aim of fostering research in the domain of dynamic graphs, first, we survey recent advantages in learning both temporal and spatial information, providing a comprehensive overview of the current state-of-the-art in the domain of representation learning for dynamic graphs. Second, we conduct a fair performance comparison among the most popular proposed approaches on node-and edge-level tasks, leveraging rigorous model selection and assessment for all the methods, thus establishing a sound baseline for evaluating new architectures and approaches.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
Recent progress in research on deep graph networks (DGNs) has led to a maturation of the domain of learning on graphs. Despite the growth of this research field, there are still important challenges that are yet unsolved. Specifically, there is an urge of making DGNs suitable for predictive tasks on real-world systems of interconnected entities, which evolve over time. With the aim of fostering research in the domain of dynamic graphs, first, we survey recent advantages in learning both temporal and spatial information, providing a comprehensive overview of the current state-of-the-art in the domain of representation learning for dynamic graphs. Second, we conduct a fair performance comparison among the most popular proposed approaches on node-and edge-level tasks, leveraging rigorous model selection and assessment for all the methods, thus establishing a sound baseline for evaluating new architectures and approaches.
Close
doi:10.1109/TNNLS.2024.3379735
Close
15.
 Zhang, Kun;  Shpitser, Ilya;  Magliacane, Sara;  Bacciu, Davide;  Wu, Fei;  Zhang, Changshui;  Spirtes, Peter
IEEE Transactions on Neural Networks and Learning Systems Special Issue on Causal Discovery and Causality-Inspired Machine Learning Journal Article 
In: IEEE Transactions on Neural Networks and Learning Systems, vol. 35, no. 4, pp. 4899-4901, 2024.
Abstract | Links | BibTeX
@article{10492646,

title = {IEEE Transactions on Neural Networks and Learning Systems Special Issue on Causal Discovery and Causality-Inspired Machine Learning},

author = {Kun Zhang and Ilya Shpitser and Sara Magliacane and Davide Bacciu and Fei Wu and Changshui Zhang and Peter Spirtes},

doi = {10.1109/TNNLS.2024.3365968},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

journal = {IEEE Transactions on Neural Networks and Learning Systems},

volume = {35},

number = {4},

pages = {4899-4901},

abstract = {Causality is a fundamental notion in science and engineering. It has attracted much interest across research communities in statistics, machine learning (ML), healthcare, and artificial intelligence (AI), and is becoming increasingly recognized as a vital research area. One of the fundamental problems in causality is how to find the causal structure or the underlying causal model. Accordingly, one focus of this Special Issue is on causal discovery , i.e., how can we discover causal structure over a set of variables from observational data with automated procedures? Besides learning causality, another focus is on using causality to help understand and advance ML, that is, causality-inspired ML.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
Causality is a fundamental notion in science and engineering. It has attracted much interest across research communities in statistics, machine learning (ML), healthcare, and artificial intelligence (AI), and is becoming increasingly recognized as a vital research area. One of the fundamental problems in causality is how to find the causal structure or the underlying causal model. Accordingly, one focus of this Special Issue is on causal discovery , i.e., how can we discover causal structure over a set of variables from observational data with automated procedures? Besides learning causality, another focus is on using causality to help understand and advance ML, that is, causality-inspired ML.
Close
doi:10.1109/TNNLS.2024.3365968
Close
16.
 Resta, Michele;  Bacciu, Davide
Self-generated Replay Memories for Continual Neural Machine Translation Proceedings Article 
In: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pp. 175–191, 2024.
Abstract | BibTeX
@inproceedings{resta2024self,

title = {Self-generated Replay Memories for Continual Neural Machine Translation},

author = {Michele Resta and Davide Bacciu},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

booktitle = {Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)},

pages = {175–191},

abstract = {Modern Neural Machine Translation systems exhibit strong performance in several different languages and are constantly improving. Their ability to learn continuously is, however, still severely limited by the catastrophic forgetting issue. In this work, we leverage a key property of encoder-decoder Transformers, i.e. their generative ability, to propose a novel approach to continually learning Neural Machine Translation systems. We show how this can effectively learn on a stream of experiences comprising different languages, by leveraging a replay memory populated by using the model itself as a generator of parallel sentences. We empirically demonstrate that our approach can counteract catastrophic forgetting without requiring explicit memorization of training data. Code will be publicly available upon publication. Code: https://github.com/m-resta/sg-rep},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close
Modern Neural Machine Translation systems exhibit strong performance in several different languages and are constantly improving. Their ability to learn continuously is, however, still severely limited by the catastrophic forgetting issue. In this work, we leverage a key property of encoder-decoder Transformers, i.e. their generative ability, to propose a novel approach to continually learning Neural Machine Translation systems. We show how this can effectively learn on a stream of experiences comprising different languages, by leveraging a replay memory populated by using the model itself as a generator of parallel sentences. We empirically demonstrate that our approach can counteract catastrophic forgetting without requiring explicit memorization of training data. Code will be publicly available upon publication. Code: https://github.com/m-resta/sg-rep
Close
17.
 Cossu, Andrea;  Carta, Antonio;  Passaro, Lucia;  Lomonaco, Vincenzo;  Tuytelaars, Tinne;  Bacciu, Davide
Continual pre-training mitigates forgetting in language and vision Journal Article 
In: Neural Networks, vol. 179, pp. 106492, 2024, ISSN: 0893-6080.
Abstract | Links | BibTeX
@article{COSSU2024106492,

title = {Continual pre-training mitigates forgetting in language and vision},

author = {Andrea Cossu and Antonio Carta and Lucia Passaro and Vincenzo Lomonaco and Tinne Tuytelaars and Davide Bacciu},

url = {https://www.sciencedirect.com/science/article/pii/S0893608024004167},

doi = {https://doi.org/10.1016/j.neunet.2024.106492},

issn = {0893-6080},

year  = {2024},

date = {2024-01-01},

urldate = {2024-01-01},

journal = {Neural Networks},

volume = {179},

pages = {106492},

abstract = {Pre-trained models are commonly used in Continual Learning to initialize the model before training on the stream of non-stationary data. However, pre-training is rarely applied during Continual Learning. We investigate the characteristics of the Continual Pre-Training scenario, where a model is continually pre-trained on a stream of incoming data and only later fine-tuned to different downstream tasks. We introduce an evaluation protocol for Continual Pre-Training which monitors forgetting against a Forgetting Control dataset not present in the continual stream. We disentangle the impact on forgetting of 3 main factors: the input modality (NLP, Vision), the architecture type (Transformer, ResNet) and the pre-training protocol (supervised, self-supervised). Moreover, we propose a Sample-Efficient Pre-training method (SEP) that speeds up the pre-training phase. We show that the pre-training protocol is the most important factor accounting for forgetting. Surprisingly, we discovered that self-supervised continual pre-training in both NLP and Vision is sufficient to mitigate forgetting without the use of any Continual Learning strategy. Other factors, like model depth, input modality and architecture type are not as crucial.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
Pre-trained models are commonly used in Continual Learning to initialize the model before training on the stream of non-stationary data. However, pre-training is rarely applied during Continual Learning. We investigate the characteristics of the Continual Pre-Training scenario, where a model is continually pre-trained on a stream of incoming data and only later fine-tuned to different downstream tasks. We introduce an evaluation protocol for Continual Pre-Training which monitors forgetting against a Forgetting Control dataset not present in the continual stream. We disentangle the impact on forgetting of 3 main factors: the input modality (NLP, Vision), the architecture type (Transformer, ResNet) and the pre-training protocol (supervised, self-supervised). Moreover, we propose a Sample-Efficient Pre-training method (SEP) that speeds up the pre-training phase. We show that the pre-training protocol is the most important factor accounting for forgetting. Surprisingly, we discovered that self-supervised continual pre-training in both NLP and Vision is sufficient to mitigate forgetting without the use of any Continual Learning strategy. Other factors, like model depth, input modality and architecture type are not as crucial.
Close
https://www.sciencedirect.com/science/article/pii/S0893608024004167
doi:https://doi.org/10.1016/j.neunet.2024.106492
Close
2023
18.
 Lepri, Marco;  Bacciu, Davide;  Santina, Cosimo Della
Neural Autoencoder-Based Structure-Preserving Model Order Reduction and Control Design for High-Dimensional Physical Systems Journal Article 
In: IEEE Control Systems Letters, 2023.
BibTeX
@article{lepri2023neural,

title = {Neural Autoencoder-Based Structure-Preserving Model Order Reduction and Control Design for High-Dimensional Physical Systems},

author = {Marco Lepri and Davide Bacciu and Cosimo Della Santina},

year  = {2023},

date = {2023-12-21},

urldate = {2023-01-01},

journal = {IEEE Control Systems Letters},

publisher = {IEEE},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
19.
 Georgiev, Dobrik Georgiev;  Numeroso, Danilo;  Bacciu, Davide;  Liò, Pietro
Neural algorithmic reasoning for combinatorial optimisation Proceedings Article 
In: Learning on Graphs Conference, pp. 28–1, PMLR 2023.
Abstract | BibTeX
@inproceedings{georgiev2024neural,

title = {Neural algorithmic reasoning for combinatorial optimisation},

author = {Dobrik Georgiev Georgiev and Danilo Numeroso and Davide Bacciu and Pietro Liò},

year  = {2023},

date = {2023-12-15},

urldate = {2023-12-15},

booktitle = {Learning on Graphs Conference},

pages = {28–1},

organization = {PMLR},

abstract = {Solving NP-hard/complete combinatorial problems with neural networks is a challenging research area that aims to surpass classical approximate algorithms. The long-term objective is to outperform hand-designed heuristics for NP-hard/complete problems by learning to generate superior solutions solely from training data. Current neural-based methods for solving CO problems often overlook the inherent" algorithmic" nature of the problems. In contrast, heuristics designed for CO problems, eg TSP, frequently leverage well-established algorithms, such as those for finding the minimum spanning tree. In this paper, we propose leveraging recent advancements in neural algorithmic reasoning to improve the learning of CO problems. Specifically, we suggest pre-training our neural model on relevant algorithms before training it on CO instances. Our results demonstrate that, using this learning setup, we achieve superior performance compared to non-algorithmically informed deep learning models.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

Close
Solving NP-hard/complete combinatorial problems with neural networks is a challenging research area that aims to surpass classical approximate algorithms. The long-term objective is to outperform hand-designed heuristics for NP-hard/complete problems by learning to generate superior solutions solely from training data. Current neural-based methods for solving CO problems often overlook the inherent" algorithmic" nature of the problems. In contrast, heuristics designed for CO problems, eg TSP, frequently leverage well-established algorithms, such as those for finding the minimum spanning tree. In this paper, we propose leveraging recent advancements in neural algorithmic reasoning to improve the learning of CO problems. Specifically, we suggest pre-training our neural model on relevant algorithms before training it on CO instances. Our results demonstrate that, using this learning setup, we achieve superior performance compared to non-algorithmically informed deep learning models.
Close
20.
 Gravina, Alessio;  Lovisotto, Giulio;  Gallicchio, Claudio;  Bacciu, Davide;  Grohnfeldt, Claas
Effective Non-Dissipative Propagation for Continuous-Time Dynamic Graphs Workshop 
Temporal Graph Learning Workshop, NeurIPS 2023, 2023.
Abstract | Links | BibTeX
@workshop{Gravina2023b,

title = {Effective Non-Dissipative Propagation for Continuous-Time Dynamic Graphs},

author = {Alessio Gravina and Giulio Lovisotto and Claudio Gallicchio and Davide Bacciu and Claas Grohnfeldt},

url = {https://openreview.net/forum?id=zAHFC2LNEe, PDF},

year  = {2023},

date = {2023-12-11},

urldate = {2023-12-11},

booktitle = {Temporal Graph Learning Workshop, NeurIPS 2023},

abstract = {Recent research on Deep Graph Networks (DGNs) has broadened the domain of learning on graphs to real-world systems of interconnected entities that evolve over time. This paper addresses prediction problems on graphs defined by a stream of events, possibly irregularly sampled over time, generally referred to as Continuous-Time Dynamic Graphs (C-TDGs). While many predictive problems on graphs may require capturing interactions between nodes at different distances, existing DGNs for C-TDGs are not designed to propagate and preserve long-range information - resulting in suboptimal performance. In this work, we present Continuous-Time Graph Anti-Symmetric Network (CTAN), a DGN for C-TDGs designed within the ordinary differential equations framework that enables efficient propagation of long-range dependencies. We show that our method robustly performs stable and non-dissipative information propagation over dynamically evolving graphs, where the number of ODE discretization steps allows scaling the propagation range. We empirically validate the proposed approach on several real and synthetic graph benchmarks, showing that CTAN leads to improved performance while enabling the propagation of long-range information},

keywords = {},

pubstate = {published},

tppubtype = {workshop}

}

Close
Recent research on Deep Graph Networks (DGNs) has broadened the domain of learning on graphs to real-world systems of interconnected entities that evolve over time. This paper addresses prediction problems on graphs defined by a stream of events, possibly irregularly sampled over time, generally referred to as Continuous-Time Dynamic Graphs (C-TDGs). While many predictive problems on graphs may require capturing interactions between nodes at different distances, existing DGNs for C-TDGs are not designed to propagate and preserve long-range information - resulting in suboptimal performance. In this work, we present Continuous-Time Graph Anti-Symmetric Network (CTAN), a DGN for C-TDGs designed within the ordinary differential equations framework that enables efficient propagation of long-range dependencies. We show that our method robustly performs stable and non-dissipative information propagation over dynamically evolving graphs, where the number of ODE discretization steps allows scaling the propagation range. We empirically validate the proposed approach on several real and synthetic graph benchmarks, showing that CTAN leads to improved performance while enabling the propagation of long-range information
Close
PDF
Close
21.
 Errica, Federico;  Bacciu, Davide;  Micheli, Alessio
PyDGN: a Python Library for Flexible and Reproducible Research on Deep Learning for Graphs Journal Article 
In: Journal of Open Source Software, vol. 8, no. 90, pp. 5713, 2023.
BibTeX
@article{errica2023pydgn,

title = {PyDGN: a Python Library for Flexible and Reproducible Research on Deep Learning for Graphs},

author = {Federico Errica and Davide Bacciu and Alessio Micheli},

year  = {2023},

date = {2023-10-31},

urldate = {2023-01-01},

journal = {Journal of Open Source Software},

volume = {8},

number = {90},

pages = {5713},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
22.
 Errica, Federico;  Gravina, Alessio;  Bacciu, Davide;  Micheli, Alessio
Hidden Markov Models for Temporal Graph Representation Learning Conference 
Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning , 2023.
BibTeX
@conference{Errica2023,

title = {Hidden Markov Models for Temporal Graph Representation Learning},

author = {Federico Errica and Alessio Gravina and Davide Bacciu and Alessio Micheli},

editor = {Michel Verleysen},

year  = {2023},

date = {2023-10-04},

urldate = {2023-10-04},

booktitle = {Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning },

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
23.
 Landolfi, Francesco;  Bacciu, Davide;  Numeroso, Danilo
 A Tropical View of Graph Neural Networks  Conference 
Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning , 2023.
BibTeX
@conference{Landolfi2023,

title = { A Tropical View of Graph Neural Networks },

author = {Francesco Landolfi and Davide Bacciu and Danilo Numeroso



},

editor = {Michel Verleysen},

year  = {2023},

date = {2023-10-04},

urldate = {2023-10-04},

booktitle = {Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning },

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
24.
 Ceni, Andrea;  Bacciu, Davide;  Caro, Valerio De;  Gallicchio, Claudio;  Oneto, Luca
 Improving Fairness via Intrinsic Plasticity in Echo State Networks  Conference 
Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning , 2023.
BibTeX
@conference{Ceni2023,

title = { Improving Fairness via Intrinsic Plasticity in Echo State Networks },

author = {Andrea Ceni and Davide Bacciu and Valerio De Caro and Claudio Gallicchio and Luca Oneto

},

editor = {Michel Verleysen},

year  = {2023},

date = {2023-10-04},

urldate = {2023-10-04},

booktitle = {Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning },

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
25.
 Cossu, Andrea;  Spinnato, Francesco;  Guidotti, Riccardo;  Bacciu, Davide
 A Protocol for Continual Explanation of SHAP  Conference 
Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning , 2023.
BibTeX
@conference{Cossu2023,

title = { A Protocol for Continual Explanation of SHAP },

author = {Andrea Cossu and Francesco Spinnato and Riccardo Guidotti and Davide Bacciu},

editor = {Michel Verleysen},

year  = {2023},

date = {2023-10-04},

urldate = {2023-10-04},

booktitle = {Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning },

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
26.
 Caro, Valerio De;  Mauro, Antonio Di;  Bacciu, Davide;  Gallicchio, Claudio
 Communication-Efficient Ridge Regression in Federated Echo State Networks  Conference 
Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning , 2023.
BibTeX
@conference{Caro2023,

title = { Communication-Efficient Ridge Regression in Federated Echo State Networks },

author = {Valerio De Caro and Antonio Di Mauro and Davide Bacciu and Claudio Gallicchio



},

editor = {Michel Verleysen},

year  = {2023},

date = {2023-10-04},

urldate = {2023-10-04},

booktitle = {Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning },

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
27.
 Bacciu, Davide;  Errica, Federico;  Micheli, Alessio;  Navarin, Nicolò;  Pasa, Luca;  Podda, Marco;  Zambon, Daniele
Graph Representation Learning  Conference 
Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning , 2023.
BibTeX
@conference{Bacciu2023c,

title = {Graph Representation Learning },

author = {Davide Bacciu and Federico Errica and Alessio Micheli and Nicolò Navarin and Luca Pasa and Marco Podda and Daniele Zambon



},

editor = {Michel Verleysen},

year  = {2023},

date = {2023-10-04},

urldate = {2023-10-04},

booktitle = {Proceedings of the 31th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning },

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
28.
 Ceni, Andrea;  Cossu, Andrea;  Liu, Jingyue;  Stölzle, Maximilian;  Santina, Cosimo Della;  Gallicchio, Claudio;  Bacciu, Davide
Randomly Coupled Oscillators Workshop 
Proceedings of the ECML/PKDD Workshop on Deep Learning meets Neuromorphic Hardware, 2023.
BibTeX
@workshop{Ceni2023c,

title = {Randomly Coupled Oscillators},

author = {Andrea Ceni and Andrea Cossu and Jingyue Liu and Maximilian Stölzle and Cosimo Della Santina and Claudio Gallicchio and Davide Bacciu},

year  = {2023},

date = {2023-09-18},

booktitle = {Proceedings of the ECML/PKDD Workshop on Deep Learning meets Neuromorphic Hardware},

keywords = {},

pubstate = {published},

tppubtype = {workshop}

}

Close
29.
 Gravina, Alessio;  Gallicchio, Claudio;  Bacciu, Davide
Non-Dissipative Propagation by Randomized Anti-Symmetric Deep Graph Networks Workshop 
Proceedings of the ECML/PKDD Workshop on Deep Learning meets Neuromorphic Hardware, 2023.
BibTeX
@workshop{Gravina2023c,

title = {Non-Dissipative Propagation by Randomized Anti-Symmetric Deep Graph Networks},

author = {Alessio Gravina and Claudio Gallicchio and Davide Bacciu},

year  = {2023},

date = {2023-09-18},

urldate = {2023-09-18},

booktitle = {Proceedings of the ECML/PKDD Workshop on Deep Learning meets Neuromorphic Hardware},

keywords = {},

pubstate = {published},

tppubtype = {workshop}

}

Close
30.
 Cosenza, Emanuele;  Valenti, Andrea;  Bacciu, Davide
Graph-based Polyphonic Multitrack Music Generation Conference 
Proceedings of the 32nd INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI 2023), 2023.
BibTeX
@conference{Cosenza2023,

title = {Graph-based Polyphonic Multitrack Music Generation},

author = {Emanuele Cosenza and Andrea Valenti and Davide Bacciu },

year  = {2023},

date = {2023-08-19},

urldate = {2023-08-19},

booktitle = {Proceedings of the 32nd INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI 2023)},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
31.
 Hemati, Hamed;  Lomonaco, Vincenzo;  Bacciu, Davide;  Borth, Damian
Partial Hypernetworks for Continual Learning Conference 
Proceedings of the International Conference on Lifelong Learning Agents (CoLLAs 2023), Proceedings of Machine Learning Research, 2023.
BibTeX
@conference{Hemati2023,

title = {Partial Hypernetworks for Continual Learning},

author = {Hamed Hemati and Vincenzo Lomonaco and Davide Bacciu and Damian Borth},

year  = {2023},

date = {2023-08-01},

urldate = {2023-08-01},

booktitle = {Proceedings of the International Conference on Lifelong Learning Agents (CoLLAs 2023)},

publisher = {Proceedings of Machine Learning Research},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
32.
 Hemati, Hamed;  Cossu, Andrea;  Carta, Antonio;  Hurtado, Julio;  Pellegrini, Lorenzo;  Bacciu, Davide;  Lomonaco, Vincenzo;  Borth, Damian
Class-Incremental Learning with Repetition  Conference 
Proceedings of the International Conference on Lifelong Learning Agents (CoLLAs 2023), Proceedings of Machine Learning Research, 2023.
BibTeX
@conference{Hemati2023b,

title = {Class-Incremental Learning with Repetition },

author = {Hamed Hemati and Andrea Cossu and Antonio Carta and Julio Hurtado and Lorenzo Pellegrini and Davide Bacciu and Vincenzo Lomonaco and Damian Borth},

year  = {2023},

date = {2023-08-01},

urldate = {2023-08-01},

booktitle = {Proceedings of the International Conference on Lifelong Learning Agents (CoLLAs 2023)},

publisher = {Proceedings of Machine Learning Research},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
33.
 Caro, Valerio De;  Bacciu, Davide;  Gallicchio, Claudio
Decentralized Plasticity in Reservoir Dynamical Networks for Pervasive Environments Workshop 
Proceedings of the 2023 ICML Workshop on Localized Learning: Decentralized Model Updates via Non-Global Objectives 
, 2023.
Links | BibTeX
@workshop{nokey,

title = {Decentralized Plasticity in Reservoir Dynamical Networks for Pervasive Environments},

author = {Valerio De Caro and Davide Bacciu and Claudio Gallicchio

},

url = {https://openreview.net/forum?id=5hScPOeDaR, PDF},

year  = {2023},

date = {2023-07-29},

urldate = {2023-07-29},

booktitle = {Proceedings of the 2023 ICML Workshop on Localized Learning: Decentralized Model Updates via Non-Global Objectives 

},

keywords = {},

pubstate = {published},

tppubtype = {workshop}

}

Close
PDF
Close
34.
 Ceni, Andrea;  Cossu, Andrea;  Liu, Jingyue;  Stölzle, Maximilian;  Santina, Cosimo Della;  Gallicchio, Claudio;  Bacciu, Davide
Randomly Coupled Oscillators for Time Series Processing Workshop 
Proceedings of the 2023 ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems , 2023.
Links | BibTeX
@workshop{Ceni2023b,

title = {Randomly Coupled Oscillators for Time Series Processing},

author = {Andrea Ceni and Andrea Cossu and Jingyue Liu and Maximilian Stölzle and Cosimo Della Santina and Claudio Gallicchio and Davide Bacciu},

url = {https://openreview.net/forum?id=fmn7PMykEb, PDF},

year  = {2023},

date = {2023-07-28},

urldate = {2023-07-28},

booktitle = {Proceedings of the 2023 ICML Workshop on New Frontiers in Learning, Control, and Dynamical Systems },

keywords = {},

pubstate = {published},

tppubtype = {workshop}

}

Close
PDF
Close
35.
 Massidda, Riccardo;  Landolfi, Francesco;  Cinquini, Martina;  Bacciu, Davide
Differentiable Causal Discovery with Smooth Acyclic Orientations Workshop 
Proceedings of the 2023 ICML Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators , 2023.
Links | BibTeX
@workshop{Massidda2023b,

title = {Differentiable Causal Discovery with Smooth Acyclic Orientations},

author = {Riccardo Massidda and Francesco Landolfi and Martina Cinquini and Davide Bacciu},

url = {https://openreview.net/forum?id=IVwWgscehR, PDF},

year  = {2023},

date = {2023-07-28},

urldate = {2023-07-28},

booktitle = {Proceedings of the 2023 ICML Workshop on Differentiable Almost Everything: Differentiable Relaxations, Algorithms, Operators, and Simulators },

keywords = {},

pubstate = {published},

tppubtype = {workshop}

}

Close
PDF
Close
36.
 Simone, Lorenzo;  Bacciu, Davide
ECGAN: generative adversarial network for electrocardiography Conference 
Proceedings of Artificial Intelligence In Medicine 2023 (AIME 2023), 2023.
BibTeX
@conference{nokey,

title = {ECGAN: generative adversarial network for electrocardiography},

author = {Lorenzo Simone and Davide Bacciu },

year  = {2023},

date = {2023-06-12},

urldate = {2023-06-12},

booktitle = {Proceedings of Artificial Intelligence In Medicine 2023 (AIME 2023)},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
37.
 Lomonaco, Vincenzo;  Caro, Valerio De;  Gallicchio, Claudio;  Carta, Antonio;  Sardianos, Christos;  Varlamis, Iraklis;  Tserpes, Konstantinos;  Coppola, Massimo;  Marpena, Mina;  Politi, Sevasti;  Schoitsch, Erwin;  Bacciu, Davide
AI-Toolkit: a Microservices Architecture for Low-Code Decentralized Machine Intelligence Conference 
Proceedings of 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023.
Abstract | BibTeX
@conference{Lomonaco2023,

title = {AI-Toolkit: a Microservices Architecture for Low-Code Decentralized Machine Intelligence},

author = {Vincenzo Lomonaco and Valerio De Caro and Claudio Gallicchio and Antonio Carta and Christos Sardianos and Iraklis Varlamis and Konstantinos Tserpes and Massimo Coppola and Mina Marpena and Sevasti Politi and Erwin Schoitsch and Davide Bacciu},

year  = {2023},

date = {2023-06-04},

urldate = {2023-06-04},

booktitle = {Proceedings of 2023 IEEE International Conference on Acoustics, Speech and Signal Processing},

abstract = {Artificial Intelligence and Machine Learning toolkits such as Scikit-learn, PyTorch and Tensorflow provide today a solid starting point for the rapid prototyping of R&D solutions. However, they can be hardly ported to heterogeneous decentralised hardware and real-world production environments. A common practice involves outsourcing deployment solutions to scalable cloud infrastructures such as Amazon SageMaker or Microsoft Azure. In this paper, we proposed an open-source microservices-based architecture for decentralised machine intelligence which aims at bringing R&D and deployment functionalities closer following a low-code approach. Such an approach would guarantee flexible integration of cutting-edge functionalities while preserving complete control over the deployed solutions at negligible costs and maintenance efforts.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
Artificial Intelligence and Machine Learning toolkits such as Scikit-learn, PyTorch and Tensorflow provide today a solid starting point for the rapid prototyping of R&D solutions. However, they can be hardly ported to heterogeneous decentralised hardware and real-world production environments. A common practice involves outsourcing deployment solutions to scalable cloud infrastructures such as Amazon SageMaker or Microsoft Azure. In this paper, we proposed an open-source microservices-based architecture for decentralised machine intelligence which aims at bringing R&D and deployment functionalities closer following a low-code approach. Such an approach would guarantee flexible integration of cutting-edge functionalities while preserving complete control over the deployed solutions at negligible costs and maintenance efforts.
Close
38.
 Caro, Valerio De;  Danzinger, Herbert;  Gallicchio, Claudio;  Könczöl, Clemens;  Lomonaco, Vincenzo;  Marmpena, Mina;  Marpena, Mina;  Politi, Sevasti;  Veledar, Omar;  Bacciu, Davide
Prediction of Driver's Stress Affection in Simulated Autonomous Driving Scenarios Conference 
Proceedings of 2023 IEEE International Conference on Acoustics, Speech and Signal Processing, 2023.
Abstract | BibTeX
@conference{DeCaro2023,

title = {Prediction of Driver's Stress Affection in Simulated Autonomous Driving Scenarios},

author = {Valerio De Caro and Herbert Danzinger and Claudio Gallicchio and Clemens Könczöl and Vincenzo Lomonaco and Mina Marmpena and Mina Marpena and Sevasti Politi and Omar Veledar and Davide Bacciu},

year  = {2023},

date = {2023-06-04},

urldate = {2023-06-04},

booktitle = {Proceedings of 2023 IEEE International Conference on Acoustics, Speech and Signal Processing},

abstract = {We investigate the task of predicting stress affection from physiological data of users experiencing simulations of autonomous driving. We approach this task on two levels of granularity, depending on whether the prediction is performed at end of the simulation, or along the simulation. In the former, denoted as coarse-grained prediction, we employed Decision Trees. In the latter, denoted as fine-grained prediction, we employed Echo State Networks, a Recurrent Neural Network

that allows efficient learning from temporal data and hence is

suitable for pervasive environments. We conduct experiments on a private dataset of physiological data from people participating in multiple driving scenarios simulating different stressful events. The results show that the proposed model is capable of detecting conditions of event-related cognitive stress proving, the existence of a correlation between stressful events and the physiological data.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
We investigate the task of predicting stress affection from physiological data of users experiencing simulations of autonomous driving. We approach this task on two levels of granularity, depending on whether the prediction is performed at end of the simulation, or along the simulation. In the former, denoted as coarse-grained prediction, we employed Decision Trees. In the latter, denoted as fine-grained prediction, we employed Echo State Networks, a Recurrent Neural Network

that allows efficient learning from temporal data and hence is

suitable for pervasive environments. We conduct experiments on a private dataset of physiological data from people participating in multiple driving scenarios simulating different stressful events. The results show that the proposed model is capable of detecting conditions of event-related cognitive stress proving, the existence of a correlation between stressful events and the physiological data.
Close
39.
 Gravina, Alessio;  Bacciu, Davide;  Gallicchio, Claudio
Anti-Symmetric DGN: a stable architecture for Deep Graph Networks Conference 
Proceedings of the Eleventh International Conference on Learning Representations (ICLR 2023)  , 2023.
Abstract | Links | BibTeX
@conference{Gravina2023,

title = {Anti-Symmetric DGN: a stable architecture for Deep Graph Networks},

author = {Alessio Gravina and Davide Bacciu and Claudio Gallicchio},

url = {https://openreview.net/pdf?id=J3Y7cgZOOS},

year  = {2023},

date = {2023-05-01},

urldate = {2023-05-01},

booktitle = {Proceedings of the Eleventh International Conference on Learning Representations (ICLR 2023)  },

abstract = {Deep Graph Networks (DGNs) currently dominate the research landscape of learning from graphs, due to their efficiency and ability to implement an adaptive message-passing scheme between the nodes. However, DGNs are typically limited in their ability to propagate and preserve long-term dependencies between nodes, i.e., they suffer from the over-squashing phenomena. As a result, we can expect them to under-perform, since different problems require to capture interactions at different (and possibly large) radii in order to be effectively solved. In this work, we present Anti-Symmetric Deep Graph Networks (A-DGNs), a framework for stable and non-dissipative DGN design, conceived through the lens of ordinary differential equations. We give theoretical proof that our method is stable and non-dissipative, leading to two key results: long-range information between nodes is preserved, and no gradient vanishing or explosion occurs in training. We empirically validate the proposed approach on several graph benchmarks, showing that A-DGN yields to improved performance and enables to learn effectively even when dozens of layers are used.ers are used.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
Deep Graph Networks (DGNs) currently dominate the research landscape of learning from graphs, due to their efficiency and ability to implement an adaptive message-passing scheme between the nodes. However, DGNs are typically limited in their ability to propagate and preserve long-term dependencies between nodes, i.e., they suffer from the over-squashing phenomena. As a result, we can expect them to under-perform, since different problems require to capture interactions at different (and possibly large) radii in order to be effectively solved. In this work, we present Anti-Symmetric Deep Graph Networks (A-DGNs), a framework for stable and non-dissipative DGN design, conceived through the lens of ordinary differential equations. We give theoretical proof that our method is stable and non-dissipative, leading to two key results: long-range information between nodes is preserved, and no gradient vanishing or explosion occurs in training. We empirically validate the proposed approach on several graph benchmarks, showing that A-DGN yields to improved performance and enables to learn effectively even when dozens of layers are used.ers are used.
Close
https://openreview.net/pdf?id=J3Y7cgZOOS
Close
40.
 Numeroso, Danilo;  Bacciu, Davide;  Veličković, Petar
Dual Algorithmic Reasoning Conference 
Proceedings of the Eleventh International Conference on Learning Representations (ICLR 2023), 2023, (Notable Spotlight paper).
Abstract | Links | BibTeX
@conference{Numeroso2023,

title = {Dual Algorithmic Reasoning},

author = {Danilo Numeroso and Davide Bacciu and Petar Veličković},

url = {https://openreview.net/pdf?id=hhvkdRdWt1F},

year  = {2023},

date = {2023-05-01},

urldate = {2023-05-01},

booktitle = {Proceedings of the Eleventh International Conference on Learning Representations (ICLR 2023)},

abstract = {Neural Algorithmic Reasoning is an emerging area of machine learning which seeks to infuse algorithmic computation in neural networks, typically by training neural models to approximate steps of classical algorithms. In this context, much of the current work has focused on learning reachability and shortest path graph algorithms, showing that joint learning on similar algorithms is beneficial for generalisation. However, when targeting more complex problems, such "similar" algorithms become more difficult to find. Here, we propose to learn algorithms by exploiting duality of the underlying algorithmic problem. Many algorithms solve optimisation problems. We demonstrate that simultaneously learning the dual definition of these optimisation problems in algorithmic learning allows for better learning and qualitatively better solutions. Specifically, we exploit the max-flow min-cut theorem to simultaneously learn these two algorithms over synthetically generated graphs, demonstrating the effectiveness of the proposed approach. We then validate the real-world utility of our dual algorithmic reasoner by deploying it on a challenging brain vessel classification task, which likely depends on the vessels’ flow properties. We demonstrate a clear performance gain when using our model within such a context, and empirically show that learning the max-flow and min-cut algorithms together is critical for achieving such a result.},

note = {Notable Spotlight paper},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
Neural Algorithmic Reasoning is an emerging area of machine learning which seeks to infuse algorithmic computation in neural networks, typically by training neural models to approximate steps of classical algorithms. In this context, much of the current work has focused on learning reachability and shortest path graph algorithms, showing that joint learning on similar algorithms is beneficial for generalisation. However, when targeting more complex problems, such "similar" algorithms become more difficult to find. Here, we propose to learn algorithms by exploiting duality of the underlying algorithmic problem. Many algorithms solve optimisation problems. We demonstrate that simultaneously learning the dual definition of these optimisation problems in algorithmic learning allows for better learning and qualitatively better solutions. Specifically, we exploit the max-flow min-cut theorem to simultaneously learn these two algorithms over synthetically generated graphs, demonstrating the effectiveness of the proposed approach. We then validate the real-world utility of our dual algorithmic reasoner by deploying it on a challenging brain vessel classification task, which likely depends on the vessels’ flow properties. We demonstrate a clear performance gain when using our model within such a context, and empirically show that learning the max-flow and min-cut algorithms together is critical for achieving such a result.
Close
https://openreview.net/pdf?id=hhvkdRdWt1F
Close
41.
 Massidda, Riccardo;  Geiger, Atticus;  Icard, Thomas;  Bacciu, Davide
Causal Abstraction with Soft Interventions Conference 
Proceedings of the 2nd Conference on Causal Learning and Reasoning (CLeaR 2023), PMLR, 2023.
BibTeX
@conference{Massidda2023,

title = {Causal Abstraction with Soft Interventions},

author = {Riccardo Massidda and Atticus Geiger and Thomas Icard and Davide Bacciu},

year  = {2023},

date = {2023-04-17},

urldate = {2023-04-17},

booktitle = {Proceedings of the 2nd Conference on Causal Learning and Reasoning (CLeaR 2023)},

publisher = {PMLR},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
42.
 Gravina, Alessio;  Bacciu, Davide;  Gallicchio, Claudio
Non-Dissipative Propagation by Anti-Symmetric Deep Graph Networks Workshop 
Proceedigns of the  Ninth International Workshop on Deep Learning on Graphs: Method and Applications (DLG-AAAI’23), 2023, (Winner of the Best Student Paper Award at DLG-AAAI23).
Abstract | Links | BibTeX
@workshop{nokey,

title = {Non-Dissipative Propagation by Anti-Symmetric Deep Graph Networks},

author = {Alessio Gravina and Davide Bacciu and Claudio Gallicchio},

url = {https://drive.google.com/file/d/1uPHhjwSa3g_hRvHwx6UnbMLgGN_cAqMu/view. PDF},

year  = {2023},

date = {2023-02-13},

urldate = {2023-02-13},

booktitle = {Proceedigns of the  Ninth International Workshop on Deep Learning on Graphs: Method and Applications (DLG-AAAI’23)},

abstract = {Deep Graph Networks (DGNs) currently dominate the research landscape of learning from graphs, due to the efficiency of their adaptive message-passing scheme between nodes. However, DGNs are typically limited in their ability to propagate and preserve long-term dependencies between nodes, i.e., they suffer from the over-squashing phenomena. This reduces their effectiveness, since predictive problems may require to capture interactions at different, and possibly large, radii in order to be effectively solved. In this work, we present Anti-Symmetric DGN (A-DGN), a framework forstable and non-dissipative DGN design, conceived through the lens of ordinary differential equations. We give theoretical proof that our method is stable and non-dissipative, leading to two key results: long-range information between nodes is preserved, and no gradient vanishing or explosion occurs in training. We empirically validate the proposed approach on several graph benchmarks, showing that A-DGN yields to improved performance and enables to learn effectively even when dozens of layers are used.},

note = {Winner of the Best Student Paper Award at DLG-AAAI23},

keywords = {},

pubstate = {published},

tppubtype = {workshop}

}

Close
Deep Graph Networks (DGNs) currently dominate the research landscape of learning from graphs, due to the efficiency of their adaptive message-passing scheme between nodes. However, DGNs are typically limited in their ability to propagate and preserve long-term dependencies between nodes, i.e., they suffer from the over-squashing phenomena. This reduces their effectiveness, since predictive problems may require to capture interactions at different, and possibly large, radii in order to be effectively solved. In this work, we present Anti-Symmetric DGN (A-DGN), a framework forstable and non-dissipative DGN design, conceived through the lens of ordinary differential equations. We give theoretical proof that our method is stable and non-dissipative, leading to two key results: long-range information between nodes is preserved, and no gradient vanishing or explosion occurs in training. We empirically validate the proposed approach on several graph benchmarks, showing that A-DGN yields to improved performance and enables to learn effectively even when dozens of layers are used.
Close
https://drive.google.com/file/d/1uPHhjwSa3g_hRvHwx6UnbMLgGN_cAqMu/view. PDF
Close
43.
 Bacciu, Davide;  Conte, Alessio;  Landolfi, Francesco
Generalizing Downsampling from Regular Data to Graphs Conference 
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023.
Abstract | Links | BibTeX
@conference{Bacciu2023,

title = {Generalizing Downsampling from Regular Data to Graphs},

author = {Davide Bacciu and Alessio Conte and Francesco Landolfi},

url = {https://arxiv.org/abs/2208.03523, Arxiv},

year  = {2023},

date = {2023-02-07},

urldate = {2023-02-07},

booktitle = {Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence},

abstract = {Downsampling produces coarsened, multi-resolution representations of data and it is used, for example, to produce lossy compression and visualization of large images, reduce computational costs, and boost deep neural representation learning. Unfortunately, due to their lack of a regular structure, there is still no consensus on how downsampling should apply to graphs and linked data. Indeed reductions in graph data are still needed for the goals described above, but reduction mechanisms do not have the same focus on preserving topological structures and properties, while allowing for resolution-tuning, as is the case in regular data downsampling. In this paper, we take a step in this direction, introducing a unifying interpretation of downsampling in regular and graph data. In particular, we define a graph coarsening mechanism which is a graph-structured counterpart of controllable equispaced coarsening mechanisms in regular data. We prove theoretical guarantees for distortion bounds on path lengths, as well as the ability to preserve key topological properties in the coarsened graphs. We leverage these concepts to define a graph pooling mechanism that we empirically assess in graph classification tasks, providing a greedy algorithm that allows efficient parallel implementation on GPUs, and showing that it compares favorably against pooling methods in literature. },

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
Downsampling produces coarsened, multi-resolution representations of data and it is used, for example, to produce lossy compression and visualization of large images, reduce computational costs, and boost deep neural representation learning. Unfortunately, due to their lack of a regular structure, there is still no consensus on how downsampling should apply to graphs and linked data. Indeed reductions in graph data are still needed for the goals described above, but reduction mechanisms do not have the same focus on preserving topological structures and properties, while allowing for resolution-tuning, as is the case in regular data downsampling. In this paper, we take a step in this direction, introducing a unifying interpretation of downsampling in regular and graph data. In particular, we define a graph coarsening mechanism which is a graph-structured counterpart of controllable equispaced coarsening mechanisms in regular data. We prove theoretical guarantees for distortion bounds on path lengths, as well as the ability to preserve key topological properties in the coarsened graphs. We leverage these concepts to define a graph pooling mechanism that we empirically assess in graph classification tasks, providing a greedy algorithm that allows efficient parallel implementation on GPUs, and showing that it compares favorably against pooling methods in literature. 
Close
Arxiv
Close
44.
 Bacciu, Davide;  Errica, Federico;  Gravina, Alessio;  Madeddu, Lorenzo;  Podda, Marco;  Stilo, Giovanni
Deep Graph Networks for Drug Repurposing with Multi-Protein Targets Journal Article 
In: IEEE Transactions on Emerging Topics in Computing, 2023, 2023.
Links | BibTeX
@article{Bacciu2023b,

title = {Deep Graph Networks for Drug Repurposing with Multi-Protein Targets},

author = {Davide Bacciu and Federico Errica and Alessio Gravina and Lorenzo Madeddu and Marco Podda and Giovanni Stilo},

doi = {10.1109/TETC.2023.3238963},

year  = {2023},

date = {2023-02-01},

urldate = {2023-02-01},

journal = {IEEE Transactions on Emerging Topics in Computing, 2023},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
doi:10.1109/TETC.2023.3238963
Close
45.
 Lanciano, Giacomo;  Galli, Filippo;  Cucinotta, Tommaso;  Bacciu, Davide;  Passarella, Andrea
Extending OpenStack Monasca for Predictive Elasticity Control Journal Article 
In: Big Data Mining and Analytics, 2023.
Links | BibTeX
@article{Lanciano2023extending,

title = {Extending OpenStack Monasca for Predictive Elasticity Control},

author = {Giacomo Lanciano and Filippo Galli and Tommaso Cucinotta and Davide Bacciu and Andrea Passarella},

doi = {10.26599/BDMA.2023.9020014},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Big Data Mining and Analytics},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
doi:10.26599/BDMA.2023.9020014
Close
46.
 Caro, Valerio De;  Gallicchio, Claudio;  Bacciu, Davide
Continual adaptation of federated reservoirs in pervasive environments Journal Article 
In: Neurocomputing, pp. 126638, 2023, ISSN: 0925-2312.
Abstract | Links | BibTeX
@article{DECARO2023126638,

title = {Continual adaptation of federated reservoirs in pervasive environments},

author = {Valerio De Caro and Claudio Gallicchio and Davide Bacciu},

url = {https://www.sciencedirect.com/science/article/pii/S0925231223007610},

doi = {https://doi.org/10.1016/j.neucom.2023.126638},

issn = {0925-2312},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {Neurocomputing},

pages = {126638},

abstract = {When performing learning tasks in pervasive environments, the main challenge arises from the need of combining federated and continual settings. The former comes from the massive distribution of devices with privacy-regulated data. The latter is required by the low resources of the participating devices, which may retain data for short periods of time. In this paper, we propose a setup for learning with Echo State Networks (ESNs) in pervasive environments. Our proposal focuses on the use of Intrinsic Plasticity (IP), a gradient-based method for adapting the reservoir’s non-linearity. First, we extend the objective function of IP to include the uncertainty arising from the distribution of the data over space and time. Then, we propose Federated Intrinsic Plasticity (FedIP), which is intended for client–server federated topologies with stationary data, and adapts the learning scheme provided by Federated Averaging (FedAvg) to include the learning rule of IP. Finally, we further extend this algorithm for learning to Federated Continual Intrinsic Plasticity (FedCLIP) to equip clients with CL strategies for dealing with continuous data streams. We evaluate our approach on an incremental setup built upon real-world datasets from human monitoring, where we tune the complexity of the scenario in terms of the distribution of the data over space and time. Results show that both our algorithms improve the representation capabilities and the performance of the ESN, while being robust to catastrophic forgetting.},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
When performing learning tasks in pervasive environments, the main challenge arises from the need of combining federated and continual settings. The former comes from the massive distribution of devices with privacy-regulated data. The latter is required by the low resources of the participating devices, which may retain data for short periods of time. In this paper, we propose a setup for learning with Echo State Networks (ESNs) in pervasive environments. Our proposal focuses on the use of Intrinsic Plasticity (IP), a gradient-based method for adapting the reservoir’s non-linearity. First, we extend the objective function of IP to include the uncertainty arising from the distribution of the data over space and time. Then, we propose Federated Intrinsic Plasticity (FedIP), which is intended for client–server federated topologies with stationary data, and adapts the learning scheme provided by Federated Averaging (FedAvg) to include the learning rule of IP. Finally, we further extend this algorithm for learning to Federated Continual Intrinsic Plasticity (FedCLIP) to equip clients with CL strategies for dealing with continuous data streams. We evaluate our approach on an incremental setup built upon real-world datasets from human monitoring, where we tune the complexity of the scenario in terms of the distribution of the data over space and time. Results show that both our algorithms improve the representation capabilities and the performance of the ESN, while being robust to catastrophic forgetting.
Close
https://www.sciencedirect.com/science/article/pii/S0925231223007610
doi:https://doi.org/10.1016/j.neucom.2023.126638
Close
47.
 Lanciano, Giacomo;  Andreoli, Remo;  Cucinotta, Tommaso;  Bacciu, Davide;  Passarella, Andrea
A 2-phase Strategy For Intelligent Cloud Operations Journal Article 
In: IEEE Access, pp. 1-1, 2023.
Links | BibTeX
@article{10239346,

title = {A 2-phase Strategy For Intelligent Cloud Operations},

author = {Giacomo Lanciano and Remo Andreoli and Tommaso Cucinotta and Davide Bacciu and Andrea Passarella},

doi = {10.1109/ACCESS.2023.3312218},

year  = {2023},

date = {2023-01-01},

urldate = {2023-01-01},

journal = {IEEE Access},

pages = {1-1},

keywords = {},

pubstate = {published},

tppubtype = {article}

}

Close
doi:10.1109/ACCESS.2023.3312218
Close
2022
48.
 Caro, Valerio De;  Gallicchio, Claudio;  Bacciu, Davide
Federated Adaptation of Reservoirs via Intrinsic Plasticity Conference 
Proceedings of the 30th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning  (ESANN 2022), 2022.
Abstract | Links | BibTeX
@conference{Caro2022,

title = {Federated Adaptation of Reservoirs via Intrinsic Plasticity},

author = {Valerio {De Caro} and Claudio Gallicchio and Davide Bacciu},

editor = {Michel Verleysen},

url = {https://arxiv.org/abs/2206.11087, Arxiv},

year  = {2022},

date = {2022-10-05},

urldate = {2022-10-05},

booktitle = {Proceedings of the 30th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning  (ESANN 2022)},

abstract = {We propose a novel algorithm for performing federated learning with Echo State Networks (ESNs) in a client-server scenario. In particular, our proposal focuses on the adaptation of reservoirs by combining Intrinsic Plasticity with Federated Averaging. The former is a gradient-based method for adapting the reservoir's non-linearity in a local and unsupervised manner, while the latter provides the framework for learning in the federated scenario. We evaluate our approach on real-world datasets from human monitoring, in comparison with the previous approach for federated ESNs existing in literature. Results show that adapting the reservoir with our algorithm provides a significant improvement on the performance of the global model. },

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
We propose a novel algorithm for performing federated learning with Echo State Networks (ESNs) in a client-server scenario. In particular, our proposal focuses on the adaptation of reservoirs by combining Intrinsic Plasticity with Federated Averaging. The former is a gradient-based method for adapting the reservoir's non-linearity in a local and unsupervised manner, while the latter provides the framework for learning in the federated scenario. We evaluate our approach on real-world datasets from human monitoring, in comparison with the previous approach for federated ESNs existing in literature. Results show that adapting the reservoir with our algorithm provides a significant improvement on the performance of the global model. 
Close
Arxiv
Close
49.
 Bacciu, Davide;  Errica, Federico;  Navarin, Nicolò;  Pasa, Luca;  Zambon, Daniele
Deep Learning for Graphs Conference 
Proceedings of the 30th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning  (ESANN 2022), 2022.
BibTeX
@conference{nokey,

title = {Deep Learning for Graphs},

author = {Davide Bacciu and Federico Errica and Nicolò Navarin and Luca Pasa and Daniele Zambon},

editor = {Michel Verleysen},

year  = {2022},

date = {2022-10-05},

urldate = {2022-10-05},

booktitle = {Proceedings of the 30th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning  (ESANN 2022)},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
50.
 Valenti, Andrea;  Bacciu, Davide
Modular Representations for Weak Disentanglement Conference 
Proceedings of the 30th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2022), 2022.
Abstract | Links | BibTeX
@conference{Valenti2022c,

title = {Modular Representations for Weak Disentanglement},

author = {Andrea Valenti and Davide Bacciu},

editor = {Michel Verleysen},

url = {https://arxiv.org/pdf/2209.05336.pdf},

year  = {2022},

date = {2022-10-05},

urldate = {2022-10-05},

booktitle = {Proceedings of the 30th European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN 2022)},

abstract = {The recently introduced weakly disentangled representations proposed to relax some constraints of the previous definitions of disentanglement, in exchange for more flexibility. However, at the moment, weak disentanglement can only be achieved by increasing the amount of supervision as the number of factors of variations of the data increase. In this paper, we introduce modular representations for weak disentanglement, a novel method that allows to keep the amount of supervised information constant with respect the number of generative factors. The experiments shows that models using modular representations can increase their performance with respect to previous work without the need of additional supervision.},

keywords = {},

pubstate = {published},

tppubtype = {conference}

}

Close
The recently introduced weakly disentangled representations proposed to relax some constraints of the previous definitions of disentanglement, in exchange for more flexibility. However, at the moment, weak disentanglement can only be achieved by increasing the amount of supervision as the number of factors of variations of the data increase. In this paper, we introduce modular representations for weak disentanglement, a novel method that allows to keep the amount of supervised information constant with respect the number of generative factors. The experiments shows that models using modular representations can increase their performance with respect to previous work without the need of additional supervision.
Close
https://arxiv.org/pdf/2209.05336.pdf
Close
224 entries « ‹ 1 of 5 › »
Davide Bacciu – Homepage

Full Professor – Dipartimento di Informatica, Università di Pisa

2024

2023

2022