3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Tisane: Authoring Statistical Models via Formal Reasoning from Conceptual and Data Relationships

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Proper statistical modeling incorporates domain theory about how concepts relate and details of how data were measured. However, data analysts currently lack tool support for recording and reasoning about domain assumptions, data collection, and modeling choices in an integrated manner, leading to mistakes that can compromise scientific validity. For instance, generalized linear mixed-effects models (GLMMs) help answer complex research questions, but omitting random effects impairs the generalizability of results. To address this need, we present Tisane, a mixed-initiative system for authoring generalized linear models with and without mixed-effects. Tisane introduces a study design specification language for expressing and asking questions about relationships between variables. Tisane contributes an interactive compilation process that represents relationships in a graph, infers candidate statistical models, and asks follow-up questions to disambiguate user queries to construct a valid model. In case studies with three researchers, we find that Tisane helps them focus on their goals and assumptions while avoiding past mistakes.

          Related collections

          Author and article information

          Journal
          07 January 2022
          Article
          10.1145/3491102.3501888
          2201.02705
          7edce39b-823b-42fc-9177-b667e8604de8

          http://creativecommons.org/licenses/by-nc-sa/4.0/

          History
          Custom metadata
          cs.AI cs.HC cs.PL stat.CO stat.OT

          Programming languages,General statistics,Artificial intelligence,Human-computer-interaction,Mathematical modeling & Computation

          Comments

          Comment on this article