BPI cluster meeting 4July’18

This week Guillaume Crognier will give a talk during our BPI cluster meeting. Timing is as usual  between 12:30-13:30 in the room Pav. K.16. Guillaume will present the outcome of his work since he joined our group.


Constructing decision trees by using Column Generation with restricted parameters


Almost every machine learning algorithm to generate decision trees is greedy, and may be far from the « optimal » decision tree. Research papers have already tried to model this problem as a MILP (mixed integer linear program), but most of them are too slow to be used in practice or cannot deal with big datasets. The purpose of this work is to show that such optimal algorithms can be greatly improved (considering the quality of the tree as well as the computational time) by using columns generation.

You are warmly invited to Guillaume’s talk.

BPI cluster meeting 20June’18

This week we will have a guest presenter,  dr. S. Faghih Roohi from OPAC group of our faculty. The place of the presentation is Pav.K1.6 and time is between 12:30-13:30.


A group decision making approach for risk ranking and lane selection in distribution of pharmaceutical products


This study aims to provide a group decision making framework based on the prioritized risks for selecting shipment lanes of pharmaceutical products. The risks involved in the decision making are identified and categorized by referring into the conventional failure modes and effect analysis (FMEA) tables. Using categorized risks, a new shortened FMEA table is proposed for evaluation by a group of experts in pharmaceutical distribution and logistics industry. The evaluations by experts are primarily in linguistic terms which are further converted to intuitionistic fuzzy numbers (IFNs) for aggregation operations. By using an intuitionistic fuzzy hybrid TOPSIS (Technique for Order Preference by Similarity to Ideal Solution) approach, the risks in shipment processes for every lane are scored and prioritized. The relative closeness coefficients of risk categories resulted from intuitionistic fuzzy hybrid TOPSIS are used for lane comparison and selection. Lane selection is performed in multiple rounds in a way that in each round, the higher-scored lanes are selected based on the lower-ranked (higher-priority) risks. The proposed approach provides an opportunity for all managers and decision makers to evaluate risks and to keep/establish current/new lanes. Finally, a case study of lane selection on air cargo distribution of pharmaceutical products is presented to demonstrate the potential applications of the proposed approach.

You are warmly invited.

BPI cluster meeting 30May’18

This week we will have a guest presenter, Gabriele Modena from ImproveDigital company. The place of the presentation is Pav.K1.6 and time is between 12:30-13:30.


Machine Learning Methods in Adtech

Short Abstract:

Machine Learning systems are used in adtech to drive decision making, revenue, and personalise the user’s online experience. ML is used to answer questions such as is the visitor a human or a bot? What creative should be displayed in order to maximise the probability of a click? What is the optimal reserve price for an impression? How many clicks will a new ad-placement system get?

Typically we need to answer these questions several tens of thousands of times per second, under soft real-time constraints.

In this presentation we’ll give an overview of use cases in the industry, and how machine learning is used at scale in the Improve Digital platform.

 About the company:

Improve Digital has the All-in-One Advertising Platform for Publishers, Content Providers and Broadcasters. Improve Digital annouces its mission as building smart, efficient, and responsible digital businesses for its enterprise customers. It creates the technology that makes advertising marketplaces possible.

You are warmly invited.

BPI cluster meeting 23May’18

This week we will have a guest presenter, dr. Kalliopi Zervanou, is currently a lecturer in Information and Computing Sciences in Utrecht University. The place of the presentation is Pav.K1.6 and time is between 12:30-13:30.


Linking multi-disciplinary data sources in the Time Capsule system



Time Capsule is a historical research system for botanical remedies from the New World in the early modern period (17-18th century): the historical evolution of economic importance, ethical attitudes, scientific interests, trade and knowledge circulation.


Historical data is scattered across collections developed for various domains and purposes. Its amount and complexity raises the need for a presentation allowing exploration and detailed inspection. Finally, the problem of information validation and sharing must be addressed. In Time Capsule we have i) integrated and linked multidisciplinary data sources and ii) developed an online research platform that supports data access, presentation, validation and sharing.


In our approach, data source integration entails concept mapping, not only across disciplines, but also in time. Thus, it calls for support for the scientific evolution from the 16th century onwards in re-classifying and re-defining concepts. Additionally, it entails dealing with phenomena of historical term variation and ambiguity which gradually give way to spelling standardisation and current nomenclature conventions in e.g. botany and biology. Furthermore, it requires addressing under-specificity and ambiguity of information found in historical sources while maintaining associations with potentially related concepts and context. Most importantly, it requires providing references for information provenance tracing and validation.

You are warmly invited.





BPI cluster meeting 2May’18

Our cluster meeting on 2nd May will be in Pav. K.16 between 12:30-13:30. This week we will have a guest lecturer Maurits Kaptein

The title and the abstract of Maurits Kaptein’s lecture are as follows:


Personalization and bandits with applications in health.



In this talk prof. Maurits Kaptein will talk about his work in the computational personalization lab at JADS (Den Bosch). Starting from a (contextual) multi-armed bandit formalization of treatment personalization Maurits will present his work on developing novel multi-armed bandit policies (e.g., bootstrapped thompson sampling), on software development to evaluate multi-armed bandit policies in the field (https://github.com/Nth-iteration-labs/streamingbandit), and on the use of this framework for personalization in (e)Health and online marketing. The talk will provide a broad overview of the work carried out by Maurits and his PhD students over the last five years.


Short Bio of Maurits Kaptein:

He received his Ph.D. with honors form the Eindhoven University of Technology, Eindhoven, the Netherlands. Next, he worked as a postdoctoral researcher at the Aalto school of Economics, Aalto, Finland. Afterwards he worked for 2 years as an assistant professor of Statistics and Research Methods at the University of Tilburg. He has previously (during his Ph.D. work) worked as a research scientist at Philips Research, Eindhoven, the Netherlands and as a distinguished visiting scholar at the CHIMe lab of Stanford University, Stanford CA, USA. He has also worked as an assistant professor in Artificial Intelligence (AI) at the Radboud University Nijmegen where he was the track leader of a master track called “Web and Language”.

BPI cluster meeting 25Apr’18

Our BPI cluster meeting on 25th  April will be a joint meeting with Data Science Center Eindhoven (DSC/e).

The DCS/e lecture will be given by Marie-Jeanne Lesot. She is an associate professor in the department of Computer Science Lab of Paris 6 (LIP6) and a member of the Learning and Fuzzy Intelligent systems (LFI) group.

Where: Kennispoort (Grote Zaal), J.F. Kennedylaan 2, Eindhoven.

When: 25th  April, as usual, between 12:30-13:30. (doors open at 12:00).

The title and the abstract of Marie-Jeanne Lesot’s lecture is as follows:


Extracting knowledge in linguistic form


Machine learning can be seen as aiming to allow users to understand the huge quantities of data they are faced with. One way to facilitate interpretation of the results consists in presenting them in natural language, offering linguistic expressions which may be easier to understand. The choice of such result formulation then has an impact on the machine learning techniques to be applied to the data. This talk will present three tasks in this framework, considering different types of data.

The first task aims at extracting gradual itemsets from numerical data, as well as contextual variants thereof, linguistically expressing information about the feature covariations, as illustrated by the example “the higher the speed, the greater the danger”. A second task aims at summarising temporal series, in particular their periodicity, using the specific quantifier “regularly”. In both cases, the question is to precisely define the associated semantics and to define efficient extraction algorithms.  A third task investigates the measure of the relevance of the linguistic terms used to express the summaries, both with respect to the data structure, in case of linguistic variables, and with respect to the cognitive interpretation, in case of approximate numerical expressions.

Short bio:

Marie-Jeanne Lesot obtained her PhD in 2005 from the University Pierre et Marie Curie in Paris. Since 2006 she has been an associate professor in the department of Computer Science Lab of Paris 6 (LIP6) and a member of the Learning and Fuzzy Intelligent systems (LFI) group. Her research interests focus on fuzzy machine learning with an objective of data interpretation and semantics integration and, in particular, to model and manage subjective information; they include similarity measures, fuzzy clustering, linguistic summaries, affective computing and information scoring.


BPI cluster meeting 4Apr’18

Our BPI cluster meeting on 4th  April will be in Pav. K.16 between 12:30-13:30.

During that session, Joao Paulo Carvalho will be our guest lecturer. Joao is from Instituto Superior Técnico, University of Lisbon, Portugal. He is nowadays on his sabbatical in our group. For detailed information about Joao Paulo Carvalho you can check the following link  https://www.l2f.inesc-id.pt/w/João_Paulo_Carvalho .

The title and the abstract of Joao’s talk is as follows:

Fuzzy Fingerprints: Identification and classification based on top-k values


Fuzzy Fingerprints (FFP) were developed as a technique to allow the identification of an individual out of a large number of suspects based on their usage habits. They were inspired by the fact that many types of data studied in the physical and social sciences can be approximated with a Zipfian distribution, where the frequency of an item is inversely proportional to its rank in the frequency table. Fuzzy Fingerprints efficiently use the implicit information contained in top-K most frequent data values to perform identification in large datasets. The term “fingerprint” is used in the sense that fingerprints are unique, and are usually left unintentionally, allowing us to identify their “owners”.

“Identification” can be seen as a specific classification task where the number of classes is unusually large. Despite being originally used as a “user identification” technique, FFP have been extended to identify and classify from single users to categories, topics or classes, and have shown to be competitive with machine learning techniques even when dealing with a small number of classes ,while exhibiting some interesting properties.

In this talk I will approach the ideas behind Fuzzy Fingerprints and show case studies and applications involving: identification of anonymous users based on their phone and web usage habits; text author identification based on their writing habits; classification and identification in social data (e.g. detecting tweets related to a given trending topic); classification based on medical text data; movie recommendation; etc.

BPI cluster meeting 28Mar’18

Our BPI cluster meeting on 28th  March will be in Pav. K.16 between 12:30-13:30.

During that session, one PhD student Jason Rhuggenaath will talk about his current studies.

The promotor of Jason is Prof. Uzay Kaymak, and co-promoters are dr. Yingqian Zhang and dr. Alp Akcay.


Please find the title and the abstract below:



Fuzzy decision trees



A popular method in machine learning for supervised classification is decision trees. In this work we propose a new framework to learn fuzzy decision trees using mathematical programming. More specifically, we encode the problem of constructing fuzzy decision trees using a Mixed Integer Linear Programming (MIP) model, which can be solved by any optimization solver.

We compare the performance of our method with the performance of off-the-shelf decision tree algorithm CART and Fuzzy Inference Systems (FIS) using benchmark data-sets. Our initial results are promising and show the advantages of using non-crisp boundaries for improving classification accuracy on testing data.


BPI cluster meeting 28Feb’18

The speaker is Joost van Twist. He obtained his MSc. Degree in the Eindhoven University of Technology . He worked in companies that are major players in their markets like Quintiq and Philips. Now he works at Viggo as a software engineer implementing planning and scheduling algorithms for Eindhoven airport.



Applications of operations research and data science at Viggo



At Viggo we are responsible for the ground handling at Eindhoven airport. In our organization of more than 400 employees and with an airfield that is continuously growing, there are many challenging reallife planning puzzles such as: Assigning parking stands and gates to planes, the scheduling of employees, and logistic puzzles for ground equipment and luggage. On top of that,  as the operations are being more and more digitalised, we have access to wide variety of data, that gives oppurtunities for doing data anlaysis. For example, being able to predict when steps in the operations will cause a delay in the flight. A lot of software is developed in-house which is unique for ground handling companies and this software is also used to perform various consultancy services.

BPI cluster meeting 21Feb’18

Our BPI cluster meeting on 21st February will be a joint session with a DSC/e lecture by Marek Reformat.

Title: Fuzziness in Processing and Representation of Web Data


The web represents an immense repository of information. A number of sources of structured and unstructured data is growing every day. There is no doubt that our dependency on web data increases continuously. However, the increased amount of data – although recognized as a positive and beneficial fact – creates challenges regarding our ability to fully utilize that data. Such situation increases pressure as well as expectations for providing better ways of processing data available on the web.

Every day, users search the web for things of their interest. On multiple occasions they expect precise results. However, human’s curiosity and a need for being exposed to different and novel things is an important part of exploration processes. Existing systems supporting users in

 their search activities provide them with some variations, but it is not a controlled process. Diversity is accidental. In the first part of the presentation, we postulate that application of fuzziness in systems supporting users in their search will allow them to guide and control mechanisms that identify alternatives, and influence recommendations. Fuzzy-based methods can be applied to scenarios where users want to relax their requirements. Here, we concentrate on social networks. A methodology for selecting groups of individuals that satisfy linguistically described requirements regarding a degree of matching between users’ interests and collective interests of groups is presented. Additionally, we describe a simple fuzzy-based recommending approach that aims at constructing lists of suggested items. This is accomplished via explicit control of requirements regarding rigorousness of identifying users who become a reference base for generated suggestions.

A novel graph-based data representation format becomes an attractive and important way of storing data. It leads to better utilization of information stored and available on the web. High connectedness of such representation provides a means to create methods and techniques that can assimilate new data and build knowledge-like data structures. Such procedures resemble a human-like way of dealing with information. One of the most popular graph-based data formats is called the Resource Description Framework (RDF). It is a data format introduced together with the concept of Semantic Web. In the second part of the presentation, we present a process of assimilating information from multiple sources of RDF data. A newly proposed form of participatory learning using propositions provides an approximate reasoning-based approach to integrate previously unknown information with already known facts. We show how participatory learning has been adapted to integrating new information represented as relations. The approach recognizes two types of variables: conjunctive and disjunctive, that are common for knowledge graphs existing on the web.

The details of the above methodologies are presented, and multiple examples illustrating behavior of the processes are provided.


Marek Reformat (IEEE SM’05) received the M.Sc. degree (Hons.) from the Technical University of Poznan, Poznan, Poland, and the Ph.D. degree from the University of Manitoba, Winnipeg, MB, Canada. He is currently a Professor with the Department of Electrical and Computer Engineering, University of Alberta, Edmonton, AB, Canada. The goal of his research activities is to develop methods and techniques for intelligent data modeling and analysis leading to translation of data into knowledge, as well as to design systems that possess abilities to imitate different aspects of human behavior. In this context, the concepts of computational intelligence—with fuzzy computing and possibility theory in particular—are key elements necessary for capturing relationships between pieces of data and knowledge, and for mimicking human ways of reasoning about opinions and facts. He also works on computational intelligence-based approaches for dealing with information stored on the web. He applies elements of fuzzy sets to social networks, linked data, and Semantic Web in order to handle inherently imprecise information, and provide users with unique facts retrieved from the data. All his activities focus on introduction of human aspects to web and software systems which will lead to the development of more human-aware and human-like systems.

You are all kindly invited to this joint session!

Date:21 February

Time:12:30 – 13:30

Remarks:Doors open at 12:00h

Location:TU/e Luna building, Corona room (Koepelzaal)

Registration Link