Skip to main content

This website only uses technically necessary cookies. They will be deleted at the latest when you close your browser. To learn more, please read our Privacy Policy.

DE EN
Login
Logo, to home
  1. You are here:
  2. Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection"
...

    Dataset: Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection"

    • RADAR Metadata
    • Content
    • Statistics
    • Technical Metadata
    Alternate identifier:
    -
    Related identifier:
    (Is Identical To) https://publikationen.bibliothek.kit.edu/1000175819 - URL
    Creator/Author:
    Bach, Jakob https://orcid.org/0000-0003-0301-2798 [Institut für Programmstrukturen und Datenorganisation (IPD), Karlsruher Institut für Technologie (KIT)]
    Contributors:
    -
    Title:
    Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection"
    Additional titles:
    -
    Description:
    (Abstract) These are the experimental data for the dissertation> Bach, Jakob. "Leveraging Constraints for User-Centric Feature Selection" at the [Department of Informatics](https://www.informatik.kit.edu/english/index.php) of the [Karlsruhe Institute of Technology](https://www.kit.edu/english/). You can find t... These are the experimental data for the dissertation> Bach, Jakob. "Leveraging Constraints for User-Centric Feature Selection" at the [Department of Informatics](https://www.informatik.kit.edu/english/index.php) of the [Karlsruhe Institute of Technology](https://www.kit.edu/english/). You can find the dissertation [here](https://doi.org/10.5445/IR/1000178649). See the `README` for details. Many input datasets (which we also provide here) either - originate from [OpenML](https://www.openml.org) and are CC-BY-licensed or - originate from [PMLB](https://epistasislab.github.io/pmlb/) and are MIT-licensed. Please see the `LICENSE` files in the corresponding `datasets/` subfolders for details.

    These are the experimental data for the dissertation> Bach, Jakob. "Leveraging Constraints for User-Centric Feature Selection" at the Department of Informatics of the Karlsruhe Institute of Technology. You can find the dissertation here. See the README for details. Many input datasets (which we also provide here) either

    • originate from OpenML and are CC-BY-licensed or
    • originate from PMLB and are MIT-licensed. Please see the LICENSE files in the corresponding datasets/ subfolders for details.
    Show all Show markdown

    (Technical Remarks) # Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection" These are the experimental data for the dissertation> Bach, Jakob. "Leveraging Constraints for User-Centric Feature Selection" at the [Department of Informatics](https://www.informatik.kit.edu/englis... # Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection" These are the experimental data for the dissertation> Bach, Jakob. "Leveraging Constraints for User-Centric Feature Selection" at the [Department of Informatics](https://www.informatik.kit.edu/english/index.php) of the [Karlsruhe Institute of Technology](https://www.kit.edu/english/). The subfolders correspond to individual chapters of the dissertation: - `chap4-syn`: Chapter 4 - "Evaluating the Impact of Constraints on Feature-Selection Results" - `chap5-ms`: Chapter 5 - "Formulating Scientific Hypotheses as Constraints - A Case Study" - `chap6-afs`: Chapter 6 - "Finding Alternative Feature Sets" - `chap7-csd`: Chapter 7 - "Discovering Sparse and Alternative Subgroup Descriptions" See the corresponding `README` files in the subfolders for more information. We already published prior versions of the experimental data, as the dissertation bases on prior papers: - Chapters 4 and 5: [Data](https://doi.org/10.35097/1345) for the [paper](https://doi.org/10.1007/s42979-022-01338-z) "An Empirical Evaluation of Constrained Feature Selection" - Chapter 6: [Data](https://doi.org/10.35097/1920) for the [paper](https://doi.org/10.48550/arXiv.2307.11607) "Finding Optimal Diverse Feature Sets with Alternative Feature Selection" (Version 2) - Chapter 7: [Data](https://doi.org/10.35097/caKKJCtoKqgxyvqG) for the [paper](https://doi.org/10.48550/arXiv.2406.01411) "Using Constraints to Discover Sparse and Alternative Subgroup Descriptions" (Version 1) For Chapters 4, 5, and 7, we mainly consolidate the existing data. In particular, all `*.csv` files (datasets and results) remain unchanged compared to the data linked above. For Chapter 6, we reran the experimental pipeline to integrate a change for the feature-selection method "Greedy Wrapper". The other feature-selection methods have not changed, but experimental data may slightly differ regarding runtimes and for results affected by solver timeouts. For all four chapters, the following files (in each subfolder) differ from prior versions: - `Evaluation_console_output.txt`: The dissertation's evaluation partly differs from the papers' evaluations (e.g., some analyses added, adapted, or removed). - `README.md`: We adapted these files to the context of the dissertation, added some explanations, and proofread them.

    Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection"

    These are the experimental data for the dissertation> Bach, Jakob. "Leveraging Constraints for User-Centric Feature Selection" at the Department of Informatics of the Karlsruhe Institute of Technology. The subfolders correspond to individual chapters of the dissertation:

    • chap4-syn: Chapter 4 - "Evaluating the Impact of Constraints on Feature-Selection Results"
    • chap5-ms: Chapter 5 - "Formulating Scientific Hypotheses as Constraints - A Case Study"
    • chap6-afs: Chapter 6 - "Finding Alternative Feature Sets"
    • chap7-csd: Chapter 7 - "Discovering Sparse and Alternative Subgroup Descriptions" See the corresponding README files in the subfolders for more information. We already published prior versions of the experimental data, as the dissertation bases on prior papers:
    • Chapters 4 and 5: Data for the paper "An Empirical Evaluation of Constrained Feature Selection"
    • Chapter 6: Data for the paper "Finding Optimal Diverse Feature Sets with Alternative Feature Selection" (Version 2)
    • Chapter 7: Data for the paper "Using Constraints to Discover Sparse and Alternative Subgroup Descriptions" (Version 1) For Chapters 4, 5, and 7, we mainly consolidate the existing data. In particular, all *.csv files (datasets and results) remain unchanged compared to the data linked above. For Chapter 6, we reran the experimental pipeline to integrate a change for the feature-selection method "Greedy Wrapper". The other feature-selection methods have not changed, but experimental data may slightly differ regarding runtimes and for results affected by solver timeouts. For all four chapters, the following files (in each subfolder) differ from prior versions:
    • Evaluation_console_output.txt: The dissertation's evaluation partly differs from the papers' evaluations (e.g., some analyses added, adapted, or removed).
    • README.md: We adapted these files to the context of the dissertation, added some explanations, and proofread them.
    Show all Show markdown
    Keywords:
    feature selection
    subgroup discovery
    constraints
    alternatives
    explainability
    interpretability
    XAI
    Related information:
    -
    Language:
    -
    Publishers:
    Karlsruhe Institute of Technology
    Production year:
    2024
    Subject areas:
    Computer Science
    Resource type:
    Dataset
    Data source:
    -
    Software used:
    -
    Data processing:
    -
    Publication year:
    2024
    Rights holders:
    Bach, Jakob https://orcid.org/0000-0003-0301-2798
    Funding:
    -
    Show all Show less
    Name Storage Metadata Upload Action
    Status:
    Published
    Uploaded by:
    kitopen
    Created on:
    2024-11-03
    Archiving date:
    2024-11-08
    Archive size:
    307.1 MB
    Archive creator:
    kitopen
    Archive checksum:
    29870ce49cee60860560c52513b31ac8 (MD5)
    Embargo period:
    -
    The metadata was corrected retroactively. The original metadata will be available after download of the dataset.
    dataset/Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection"
    DOI: 10.35097/4kjyeg0z2bxmr6eh
    Publication date: 2024-11-08
    Download Dataset
    Download (307.1 MB)

    Download Metadata
    Statistics
    0
    Views
    0
    Downloads
    Rights statement for the dataset
    This work is licensed under
    CC BY 4.0
    CC icon
    Cite Dataset
    Bach, Jakob (2024): Experimental Data for the Dissertation "Leveraging Constraints for User-Centric Feature Selection". Karlsruhe Institute of Technology. DOI: 10.35097/4kjyeg0z2bxmr6eh
    • About the Repository
    • Privacy Policy
    • Terms and Conditions
    • Legal Notices
    • Accessibility Declaration
    powered by RADAR
    1.22.9 (f) / 1.16.2 (b) / 1.22.4 (i)

    RADAR4KIT ist ein über das Internet nutzbarer Dienst für die Archivierung und Publikation von Forschungsdaten aus abgeschlossenen wissenschaftlichen Studien und Projekten für Forschende des KIT. Betreiber ist das Karlsruher Institut für Technologie (KIT). RADAR4KIT setzt auf dem von FIZ Karlsruhe angebotenen Dienst RADAR auf. Die Speicherung der Daten findet ausschließlich auf IT-Infrastruktur des KIT am Steinbuch Centre for Computing (SCC) statt.

    Eine inhaltliche Bewertung und Qualitätsprüfung findet ausschließlich durch die Datengeberinnen und Datengeber statt.

    1. Das Nutzungsverhältnis zwischen Ihnen („Datennutzerin“ bzw. „Datennutzer“) und dem KIT erschöpft sich im Download von Datenpaketen oder Metadaten. Das KIT behält sich vor, die Nutzung von RADAR4KIT einzuschränken oder den Dienst ganz einzustellen.
    2. Sofern Sie sich als Datennutzerin oder als Datennutzer registrieren lassen bzw. über Shibboleth legitimieren, kann Ihnen seitens der Datengeberin oder des Datengebers Zugriff auch auf unveröffentlichte Dokumente gewährt werden.
    3. Den Schutz Ihrer persönlichen Daten erklären die Datenschutzbestimmungen.
    4. Das KIT übernimmt für Richtigkeit, Aktualität und Zuverlässigkeit der bereitgestellten Inhalte keine Gewährleistung und Haftung, außer im Fall einer zwingenden gesetzlichen Haftung.
    5. Das KIT stellt Ihnen als Datennutzerin oder als Datennutzer für das Recherchieren in RADAR4KIT und für das Herunterladen von Datenpaketen keine Kosten in Rechnung.
    6. Sie müssen die mit dem Datenpaket verbundenen Lizenzregelungen einhalten.