Skip to content

feat: include outputFacets and inputFacets in Dataset API responses#3106

Open
psaikaushik wants to merge 1 commit intoMarquezProject:mainfrom
psaikaushik:feat/1746-include-output-input-facets-v2
Open

feat: include outputFacets and inputFacets in Dataset API responses#3106
psaikaushik wants to merge 1 commit intoMarquezProject:mainfrom
psaikaushik:feat/1746-include-output-input-facets-v2

Conversation

@psaikaushik
Copy link
Copy Markdown

@psaikaushik psaikaushik commented Apr 15, 2026

Summary

Updates the DatasetDao and DatasetVersionDao SQL queries to include facets of type 'output' in API responses, in addition to the existing 'dataset', 'unknown', and 'input' types.

Supersedes #3103 (recreated to fix DCO sign-off).

Closes #1746

Problem

Marquez stores outputFacets and inputFacets correctly in the dataset_facets table. However, the read queries filtered with:

(df.type ILIKE 'dataset' OR df.type ILIKE 'unknown' OR df.type ILIKE 'input')

This excluded 'output' type facets (e.g., OutputStatistics) from API responses.

Fix

Added OR df.type ILIKE 'output' to the facet type filter in:

  • DatasetDao.findDatasetByName
  • DatasetDao.findAll
  • DatasetVersionDao.findByUuid
  • DatasetVersionDao.findAll

Checklist

  • Minimal change (4 SQL query updates)
  • No schema changes
  • DCO sign-off included

Add 'output' type to the facet type filter in DatasetDao and
DatasetVersionDao SQL queries. Previously, facets of type 'output'
(e.g. OutputStatistics, DataQualityMetrics) were stored correctly
but excluded from read queries.

Affected queries:
- DatasetDao.findDatasetByName
- DatasetDao.findAll
- DatasetVersionDao.findByUuid
- DatasetVersionDao.findAll

Closes MarquezProject#1746

Signed-off-by: Sai Kaushik Ponnekanti <psaikaushik@gmail.com>
@psaikaushik
Copy link
Copy Markdown
Author

@wslulciuc , @davidjgoss : Can you please take a look and let me know. thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

api API layer changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Marquez Dataset APIs don't return outputFacets or inputFacets

1 participant