Skip to content

Releases: apache/texera

Apache Texera(Incubating) Release v1.1.0 RC4

06 Mar 00:08
ac909a0

Choose a tag to compare

What's Changed

  • Fix unaligned representation of python objects in BINARY format by @shengquan-ni in #3347
  • Add the computing unit managing service and k8s architecture by @bobbai00 in #3337
  • Fix streaming issue for single CSV export by @aicam in #3350
  • Move export result resource to computing unit by @aicam in #3351
  • Update the About page by @bobbai00 in #3353
  • Cap the number of displayed console messages at GUI and the max length of each message by @kunwp1 in #3346
  • Admins can make comments on users by @paulschatt in #3266
  • Add Dendrogram Visualization Operator by @colinthebomb1 in #3352
  • Avoid overlaps between name editing button and information button in operator property panel by @Ma77Ball in #3349
  • Enhance Helm Charts for Easier GCP Deployment by @shengquan-ni in #3358
  • Fix heap error for exporting zip result by @aicam in #3360
  • Improve Helm chart and jwt secret access logic by @bobbai00 in #3357
  • Add ViolinPlot Option to BoxPlot Operator by @qzt168 in #3356
  • Improve Computing Unit Frontend by @bobbai00 in #3363
  • Fix File Scan Operator Extract Function by @aglinxinyuan in #3367
  • Fix hard-coded values and wrong configurations in Helm by @bobbai00 in #3366
  • Fix the descriptions, variable names, and functions in the BoxViolinPlot operator by @qzt168 in #3365
  • Add GPU support for creating the computing unit by @bobbai00 in #3371
  • Enhance Visualization Grouping with New Category Hierarchy by @xuang7 in #3369
  • Enable dynamic resizing for the workflow name input field in the workspace by @qzt168 in #3359
  • Add slider to adjust JVM memory size when creating the computing unit by @bobbai00 in #3372
  • Ensure Operator Menu Displays All First-Level Categories by @colinthebomb1 in #3364
  • Fix metric-resolution for the metric server specified in the helm chart by @aicam in #3376
  • Fix export in cluster mode by @aicam in #3378
  • Add Bullet Chart Visualization Operator by @xuang7 in #3362
  • Disabling Envoy HTTP Timeout by @kunwp1 in #3380
  • Organize request and response structures by @aicam in #3379
  • Fix formatting issue of envoy config yaml by @shengquan-ni in #3382
  • Fix the issue where the dataset content cannot be displayed on the dataset detail page when the user is not logged in by @GspikeHalo in #3373
  • Add size info for workflow json by @paulschatt in #3256
  • Add new R library to Dockerfile and fix the jobs' order in Helm chart by @bobbai00 in #3386
  • Add admin account in the configuration file and make local-sign-up accounts restricted by @bobbai00 in #3387
  • Bump pyArrow to 19.0.1 by @aglinxinyuan in #3368
  • Create NOTICE file by @aglinxinyuan in #3388
  • Fix the issue where the File Scan operator fails to properly handle compressed files containing multiple files by @GspikeHalo in #3385
  • Enable email notification to the administrator upon an unauthorized or inactive user login attempt by @GspikeHalo in #3391
  • Ensure that after the workspace is opened the operator is displayed in the center by @GspikeHalo in #3344
  • Set image-pull-policy when creating the computing unit by @bobbai00 in #3395
  • Use MultipartUpload to implement the upload-one-file endpoint of the FileService by @aicam in #3396
  • Enable email notifications to users upon an inactive login attempt by @GspikeHalo in #3393
  • Add if statement on catch error for uploadOneFileToDataset by @aicam in #3407
  • Add a flag at the workflow workspace to prevent registering the auto persistence when the workflow is loading by @bobbai00 in #3412
  • Fix Incorrect Version and Execution Order in Backend Queries by @kunwp1 in #3410
  • Fix base image to node:18.17 in the texera-web-application.dockerfile by @bobbai00 in #3413
  • Add ASF Header to JOOQ generated files by @aglinxinyuan in #3414
  • Remove the job waiter in the helm chart and make FileService do forceful healthcheck by @bobbai00 in #3416
  • Add ASF header and RAT CI by @aglinxinyuan in #3415
  • Replace Terminate Button Icon with Close-Circle by @xuang7 in #3424
  • Add RangeSlider visualization operator by @Ma77Ball in #3374
  • Reimplement User Quota Service to remove MongoDB Dependency by @yunyad in #3402
  • Add Nested Table Visualization Operator by @colinthebomb1 in #3397
  • Sanitize Uppercase Characters to Lowercase in Dataset Names by @xuang7 in #3423
  • Add Pre-Check File Size and Type Before File Preview by @xuang7 in #3427
  • Revert Changes to Binary Chunking Due to Side Effects by @kunwp1 in #3431
  • Display Individual Progress Indicators for Concurrent File Uploads by @xuang7 in #3426
  • Add DaemonSet for ComputingUnitMaster image pre-pulling by @bobbai00 in #3419
  • Install necessary packages for building gui in texera web application dockerfile by @bobbai00 in #3433
  • Fix the compilation of Python Lambda Operator by @bobbai00 in #3438
  • Fix ChannelMarker transmission between Java and Python by @aglinxinyuan in #3434
  • Fix ChannelMarker sending on PyAmber by @aglinxinyuan in #3439
  • Implement auto-expansion of the result panel upon operator click by @GspikeHalo in #3435
  • Enhance user experience of comments box by @GspikeHalo in #3436
  • Add Volcano Plot Visualization Operator by @SarahAsad23 in #3418
  • Add share-editing-server and pylsp dockerfiles to the deployment folder by @bobbai00 in #3441
  • Fix ChannelMarker Partitioning logic on PyAmber by @aglinxinyuan in #3445
  • Add HTML format & large Arrow files to result exportation by @aicam in #3429
  • Make operator property sidebar dynamically adjust height based on content by @GspikeHalo in #3437
  • Fix Incorrect Channel ID Inclusion Across Workers in ChannelMarker by @aglinxinyuan in #3448
  • Handle mismatched datasets on displaying dashboard by @xuang7 in #3446
  • Use Input Port Materialization Reader Threads to Replace Cache Source Operators by @Xiao-zhen-Liu in #3425
  • Persist the spec of the computing_unit entity and its relationship with workflow_executions into DB by @bobbai00 in #3400
  • Add HILDA 2025 workshop paper to Readme by @shengquan-ni in #3450
  • Fix issue with workspace defaulting to previous workflow after refresh by @GspikeHalo in #3342
  • Add a new config to control the maximum websocket payload size by @bobbai00 in #3452
  • Enable adjusting shared memory for the computing unit by @bobbai00 in #3451
  • Fix operators showing "Running" state after workflow completion by @shengquan-ni in #3463
  • Add domain display in email content by @GspikeHalo in #3440
  • Fix project startup failure on macOS caused by attempting to retrieve the domain name by @GspikeHalo in #3465
  • Move gui and user-system configurations to core/config and host config-retrieval endpoints via core/config-service by @bobbai00 in #3458
  • Move all conf files and readers to core/config by @bobbai00 in #3466
  • Cleanly Enforce Input Port Dependencies by @Xiao-zhen-Liu in #3460
  • Migrate existing Markers to Control Messages by @aglinxinyuan in #3432
  • Remove Marker Class by @agl...
Read more

Apache Texera(Incubating) Release v1.1.0 RC 3

23 Feb 19:51
ab9a84b

Choose a tag to compare

What's Changed

  • Fix unaligned representation of python objects in BINARY format by @shengquan-ni in #3347
  • Add the computing unit managing service and k8s architecture by @bobbai00 in #3337
  • Fix streaming issue for single CSV export by @aicam in #3350
  • Move export result resource to computing unit by @aicam in #3351
  • Update the About page by @bobbai00 in #3353
  • Cap the number of displayed console messages at GUI and the max length of each message by @kunwp1 in #3346
  • Admins can make comments on users by @paulschatt in #3266
  • Add Dendrogram Visualization Operator by @colinthebomb1 in #3352
  • Avoid overlaps between name editing button and information button in operator property panel by @Ma77Ball in #3349
  • Enhance Helm Charts for Easier GCP Deployment by @shengquan-ni in #3358
  • Fix heap error for exporting zip result by @aicam in #3360
  • Improve Helm chart and jwt secret access logic by @bobbai00 in #3357
  • Add ViolinPlot Option to BoxPlot Operator by @qzt168 in #3356
  • Improve Computing Unit Frontend by @bobbai00 in #3363
  • Fix File Scan Operator Extract Function by @aglinxinyuan in #3367
  • Fix hard-coded values and wrong configurations in Helm by @bobbai00 in #3366
  • Fix the descriptions, variable names, and functions in the BoxViolinPlot operator by @qzt168 in #3365
  • Add GPU support for creating the computing unit by @bobbai00 in #3371
  • Enhance Visualization Grouping with New Category Hierarchy by @xuang7 in #3369
  • Enable dynamic resizing for the workflow name input field in the workspace by @qzt168 in #3359
  • Add slider to adjust JVM memory size when creating the computing unit by @bobbai00 in #3372
  • Ensure Operator Menu Displays All First-Level Categories by @colinthebomb1 in #3364
  • Fix metric-resolution for the metric server specified in the helm chart by @aicam in #3376
  • Fix export in cluster mode by @aicam in #3378
  • Add Bullet Chart Visualization Operator by @xuang7 in #3362
  • Disabling Envoy HTTP Timeout by @kunwp1 in #3380
  • Organize request and response structures by @aicam in #3379
  • Fix formatting issue of envoy config yaml by @shengquan-ni in #3382
  • Fix the issue where the dataset content cannot be displayed on the dataset detail page when the user is not logged in by @GspikeHalo in #3373
  • Add size info for workflow json by @paulschatt in #3256
  • Add new R library to Dockerfile and fix the jobs' order in Helm chart by @bobbai00 in #3386
  • Add admin account in the configuration file and make local-sign-up accounts restricted by @bobbai00 in #3387
  • Bump pyArrow to 19.0.1 by @aglinxinyuan in #3368
  • Create NOTICE file by @aglinxinyuan in #3388
  • Fix the issue where the File Scan operator fails to properly handle compressed files containing multiple files by @GspikeHalo in #3385
  • Enable email notification to the administrator upon an unauthorized or inactive user login attempt by @GspikeHalo in #3391
  • Ensure that after the workspace is opened the operator is displayed in the center by @GspikeHalo in #3344
  • Set image-pull-policy when creating the computing unit by @bobbai00 in #3395
  • Use MultipartUpload to implement the upload-one-file endpoint of the FileService by @aicam in #3396
  • Enable email notifications to users upon an inactive login attempt by @GspikeHalo in #3393
  • Add if statement on catch error for uploadOneFileToDataset by @aicam in #3407
  • Add a flag at the workflow workspace to prevent registering the auto persistence when the workflow is loading by @bobbai00 in #3412
  • Fix Incorrect Version and Execution Order in Backend Queries by @kunwp1 in #3410
  • Fix base image to node:18.17 in the texera-web-application.dockerfile by @bobbai00 in #3413
  • Add ASF Header to JOOQ generated files by @aglinxinyuan in #3414
  • Remove the job waiter in the helm chart and make FileService do forceful healthcheck by @bobbai00 in #3416
  • Add ASF header and RAT CI by @aglinxinyuan in #3415
  • Replace Terminate Button Icon with Close-Circle by @xuang7 in #3424
  • Add RangeSlider visualization operator by @Ma77Ball in #3374
  • Reimplement User Quota Service to remove MongoDB Dependency by @yunyad in #3402
  • Add Nested Table Visualization Operator by @colinthebomb1 in #3397
  • Sanitize Uppercase Characters to Lowercase in Dataset Names by @xuang7 in #3423
  • Add Pre-Check File Size and Type Before File Preview by @xuang7 in #3427
  • Revert Changes to Binary Chunking Due to Side Effects by @kunwp1 in #3431
  • Display Individual Progress Indicators for Concurrent File Uploads by @xuang7 in #3426
  • Add DaemonSet for ComputingUnitMaster image pre-pulling by @bobbai00 in #3419
  • Install necessary packages for building gui in texera web application dockerfile by @bobbai00 in #3433
  • Fix the compilation of Python Lambda Operator by @bobbai00 in #3438
  • Fix ChannelMarker transmission between Java and Python by @aglinxinyuan in #3434
  • Fix ChannelMarker sending on PyAmber by @aglinxinyuan in #3439
  • Implement auto-expansion of the result panel upon operator click by @GspikeHalo in #3435
  • Enhance user experience of comments box by @GspikeHalo in #3436
  • Add Volcano Plot Visualization Operator by @SarahAsad23 in #3418
  • Add share-editing-server and pylsp dockerfiles to the deployment folder by @bobbai00 in #3441
  • Fix ChannelMarker Partitioning logic on PyAmber by @aglinxinyuan in #3445
  • Add HTML format & large Arrow files to result exportation by @aicam in #3429
  • Make operator property sidebar dynamically adjust height based on content by @GspikeHalo in #3437
  • Fix Incorrect Channel ID Inclusion Across Workers in ChannelMarker by @aglinxinyuan in #3448
  • Handle mismatched datasets on displaying dashboard by @xuang7 in #3446
  • Use Input Port Materialization Reader Threads to Replace Cache Source Operators by @Xiao-zhen-Liu in #3425
  • Persist the spec of the computing_unit entity and its relationship with workflow_executions into DB by @bobbai00 in #3400
  • Add HILDA 2025 workshop paper to Readme by @shengquan-ni in #3450
  • Fix issue with workspace defaulting to previous workflow after refresh by @GspikeHalo in #3342
  • Add a new config to control the maximum websocket payload size by @bobbai00 in #3452
  • Enable adjusting shared memory for the computing unit by @bobbai00 in #3451
  • Fix operators showing "Running" state after workflow completion by @shengquan-ni in #3463
  • Add domain display in email content by @GspikeHalo in #3440
  • Fix project startup failure on macOS caused by attempting to retrieve the domain name by @GspikeHalo in #3465
  • Move gui and user-system configurations to core/config and host config-retrieval endpoints via core/config-service by @bobbai00 in #3458
  • Move all conf files and readers to core/config by @bobbai00 in #3466
  • Cleanly Enforce Input Port Dependencies by @Xiao-zhen-Liu in #3460
  • Migrate existing Markers to Control Messages by @aglinxinyuan in #3432
  • Remove Marker Class by @agl...
Read more

Apache Texera(Incubating) Release v1.1.0 RC

10 Feb 19:44
7048ce3

Choose a tag to compare

What's Changed

Read more

Apache Texera(Incubating) Release v1.1.0

23 Apr 22:18
7c35e42

Choose a tag to compare

Release Highlights

1. Feature Upgrades

  • Advanced Computing Unit Feature: We refined the Kubernetes-based computing unit layer by improving the UI/UX and enabling shared-memory control, GPU support and JVM memory tuning (#3337, #3363, #3371, #3372, #3400, #3451, #3591, #3604).
  • Dynamic Admin Configuration System: Site settings are now stored in the database, and admins can customize logos and favicons with a live preview of branding changes (#3453, #3480, #3700, #3743).

2. Operators

  • New Visualization Operators: We added Choropleth Map, Bullet Chart, Dendrogram, Violin Plot, Volcano Plot, Nested Table, Range Slider, and Time Series Plot operators (#3352, #3356, #3362, #3374, #3397, #3418, #3531, #3744).
  • Machine Learning Operators: We introduced a suite of scikit-learn training operators (#3535).
  • Utility Operators: We added a Sleep operator, enhanced the Generator with examples, and improved the Projection operator with column re-ordering (#3537, #3543, #3661).
  • Enhanced Visualization System: We organized operators with a new category hierarchy, improved grouping, and first-level category display (#3364, #3369).

3. Authentication & Security

  • Admin Account Management: Admin accounts can be auto-registered from configuration, and admins can leave comments to users (#3266, #3387, #3625).
  • Security Enhancements: The system tracks user activity with last-login times, separates JWT expiration from activity windows, and sends email alerts for unauthorized access attempts (#3391, #3393, #3541, #3693, #3694, #3703, #3731).
  • Access Control Improvements: We refined JWT secret access logic, optimized token refresh behavior, and added login confirmation requirements (#3357, #3703).

4. Infrastructure & Deployment

  • Apache Foundation Compliance: We added ASF headers, NOTICE and DISCLAIMER files, integrated Apache Rat checks in CI, and updated license headers (#3388, #3414, #3415, #3631, #3632, #3637).
  • Kubernetes and Helm Enhancements: We added GCP deployment support, a DaemonSet for image pre-pulling, ingress class configuration, and improved job ordering (#3358, #3366, #3376, #3386, #3419, #3652).
  • Build System Improvements: We automated jOOQ code generation in SBT, removed jOOQ from CI, and migrated builds to a JDK to support Java UDFs (#3628, #3670, #3689, #3746, #3747).
  • Configuration Management: We centralized configuration in a dedicated service, added environment-variable defaults, and stored site settings in the database (#3458, #3466, #3622).
  • Centralized Config Service: We introduced a centralized config-service with database-backed dynamic settings, enabling admins to modify branding, upload limits, authentication parameters, and system configurations through a web dashboard without service restarts or redeployment (#3453, #3458, #3466, #3480, #3511, #3522, #3533, #3622, #3664, #3693, #3700, #3743).
  • Dependency Updates: We upgraded PyArrow from 19.0.1 to 21.0.0, updated jOOQ to 3.16.23, and updated Formly to 6.3.12 (#3368, #3565, #3600, #3687).

5. UI and UX Improvements

  • Workflow Workspace Enhancements: The workspace now supports dynamic name-field resizing, double-click to open operator results, auto-centering, broken-workflow alerts, and reliable refresh behavior (#3342, #3359, #3573, #3603, #3606, #3636).
  • Result Panel Improvements: The result panel shows a compact statistics row, supports resizable panels, exports to HTML and Arrow, and auto-expands on click (#3429, #3435, #3568).
  • Visual Design Updates: We introduced a new Apache Texera logo, clarified button styling, added an adaptive navigation menu, and improved port statistics displays (#3481, #3483, #3486, #3489, #3569, #3585, #3601, #3722).
  • Dataset UI Improvements: The dataset UI shows version creation dates with timezone tooltips, displays upload speed and time estimates, updates progress in real time, and validates file sizes (#3426, #3427, #3612, #3662, #3702, #3706).
  • Admin Dashboard Enhancements: The admin dashboard includes user activity metrics, a better comment interface, column layout refinements, and a live branding preview (#3436, #3464, #3594, #3625, #3700).
  • Editor Experience: The editor shows operator descriptions in panels, supports dynamic sidebar customization, improves edge deletion and reconnection, and organizes the operator menu by category (#3349, #3364, #3411, #3437, #3479, #3533).

v1.0.0

04 Apr 15:06
59c2d0b

Choose a tag to compare

Since 2016, our team has been developing the Texera system (https://texera.io/) to support cloud-based data science, AI, and ML using GUI-based workflows. Today (April 7, 2025), we are glad to make an official release of its first major version, v1.0.0!

Major Features

  • Supporting low/no coding data science using workflows
  • Parallel data-processing engine running on computing clusters
  • Using the Apache Pekko actor-model system
  • Supporting UDFs in Python, R, and Java
  • Supporting ML training and inference
  • Including a rich collection of ML operators
  • Interactive workflow execution model that supports pausing and resuming
  • Supporting collaborations with shared editing, shared execution, and version control
  • Supporting debugging, including line-by-line debugging in Python UDFs
  • Supporting reproducibility of data analysis
  • Region-by-region execution with full pipelining in each region
  • Storing execution results using Apache Iceberg
  • Supporting version-controlled file collections on S3-compatible storage managed by LakeFS
  • Adopting a microservice-based architecture using Kubernetes and Docker
  • Supporting computing isolation and storage isolation of multiple tenants

For users and developers

  • For users who want to install Texera on a single machine using Docker, follow the instructions on this page.
  • For developers who want to contribute to Texera, follow the instructions on this page.

Texera first beta release 0.1.0

23 Sep 22:49

Choose a tag to compare

Pre-release

After the renaming process is finished, we publish the first release of Texera (version 0.1.0)