Releases: apache/texera
Releases · apache/texera
Apache Texera(Incubating) Release v1.1.0 RC4
What's Changed
- Fix unaligned representation of python objects in BINARY format by @shengquan-ni in #3347
- Add the computing unit managing service and k8s architecture by @bobbai00 in #3337
- Fix streaming issue for single CSV export by @aicam in #3350
- Move export result resource to computing unit by @aicam in #3351
- Update the About page by @bobbai00 in #3353
- Cap the number of displayed console messages at GUI and the max length of each message by @kunwp1 in #3346
- Admins can make comments on users by @paulschatt in #3266
- Add Dendrogram Visualization Operator by @colinthebomb1 in #3352
- Avoid overlaps between name editing button and information button in operator property panel by @Ma77Ball in #3349
- Enhance Helm Charts for Easier GCP Deployment by @shengquan-ni in #3358
- Fix heap error for exporting zip result by @aicam in #3360
- Improve Helm chart and jwt secret access logic by @bobbai00 in #3357
- Add ViolinPlot Option to BoxPlot Operator by @qzt168 in #3356
- Improve Computing Unit Frontend by @bobbai00 in #3363
- Fix File Scan Operator Extract Function by @aglinxinyuan in #3367
- Fix hard-coded values and wrong configurations in Helm by @bobbai00 in #3366
- Fix the descriptions, variable names, and functions in the BoxViolinPlot operator by @qzt168 in #3365
- Add GPU support for creating the computing unit by @bobbai00 in #3371
- Enhance Visualization Grouping with New Category Hierarchy by @xuang7 in #3369
- Enable dynamic resizing for the workflow name input field in the workspace by @qzt168 in #3359
- Add slider to adjust JVM memory size when creating the computing unit by @bobbai00 in #3372
- Ensure Operator Menu Displays All First-Level Categories by @colinthebomb1 in #3364
- Fix metric-resolution for the metric server specified in the helm chart by @aicam in #3376
- Fix export in cluster mode by @aicam in #3378
- Add Bullet Chart Visualization Operator by @xuang7 in #3362
- Disabling Envoy HTTP Timeout by @kunwp1 in #3380
- Organize request and response structures by @aicam in #3379
- Fix formatting issue of envoy config yaml by @shengquan-ni in #3382
- Fix the issue where the dataset content cannot be displayed on the dataset detail page when the user is not logged in by @GspikeHalo in #3373
- Add size info for workflow json by @paulschatt in #3256
- Add new R library to Dockerfile and fix the jobs' order in Helm chart by @bobbai00 in #3386
- Add admin account in the configuration file and make local-sign-up accounts
restrictedby @bobbai00 in #3387 - Bump pyArrow to 19.0.1 by @aglinxinyuan in #3368
- Create NOTICE file by @aglinxinyuan in #3388
- Fix the issue where the File Scan operator fails to properly handle compressed files containing multiple files by @GspikeHalo in #3385
- Enable email notification to the administrator upon an unauthorized or inactive user login attempt by @GspikeHalo in #3391
- Ensure that after the workspace is opened the operator is displayed in the center by @GspikeHalo in #3344
- Set
image-pull-policywhen creating the computing unit by @bobbai00 in #3395 - Use
MultipartUploadto implement the upload-one-file endpoint of the FileService by @aicam in #3396 - Enable email notifications to users upon an inactive login attempt by @GspikeHalo in #3393
- Add if statement on catch error for
uploadOneFileToDatasetby @aicam in #3407 - Add a flag at the workflow workspace to prevent registering the auto persistence when the workflow is loading by @bobbai00 in #3412
- Fix Incorrect Version and Execution Order in Backend Queries by @kunwp1 in #3410
- Fix base image to node:18.17 in the
texera-web-application.dockerfileby @bobbai00 in #3413 - Add ASF Header to JOOQ generated files by @aglinxinyuan in #3414
- Remove the job waiter in the helm chart and make
FileServicedo forceful healthcheck by @bobbai00 in #3416 - Add ASF header and RAT CI by @aglinxinyuan in #3415
- Replace Terminate Button Icon with Close-Circle by @xuang7 in #3424
- Add RangeSlider visualization operator by @Ma77Ball in #3374
- Reimplement User Quota Service to remove MongoDB Dependency by @yunyad in #3402
- Add Nested Table Visualization Operator by @colinthebomb1 in #3397
- Sanitize Uppercase Characters to Lowercase in Dataset Names by @xuang7 in #3423
- Add Pre-Check File Size and Type Before File Preview by @xuang7 in #3427
- Revert Changes to Binary Chunking Due to Side Effects by @kunwp1 in #3431
- Display Individual Progress Indicators for Concurrent File Uploads by @xuang7 in #3426
- Add DaemonSet for ComputingUnitMaster image pre-pulling by @bobbai00 in #3419
- Install necessary packages for building gui in texera web application dockerfile by @bobbai00 in #3433
- Fix the compilation of Python Lambda Operator by @bobbai00 in #3438
- Fix ChannelMarker transmission between Java and Python by @aglinxinyuan in #3434
- Fix ChannelMarker sending on PyAmber by @aglinxinyuan in #3439
- Implement auto-expansion of the result panel upon operator click by @GspikeHalo in #3435
- Enhance user experience of comments box by @GspikeHalo in #3436
- Add Volcano Plot Visualization Operator by @SarahAsad23 in #3418
- Add
share-editing-serverandpylspdockerfiles to the deployment folder by @bobbai00 in #3441 - Fix ChannelMarker Partitioning logic on PyAmber by @aglinxinyuan in #3445
- Add HTML format & large Arrow files to result exportation by @aicam in #3429
- Make operator property sidebar dynamically adjust height based on content by @GspikeHalo in #3437
- Fix Incorrect Channel ID Inclusion Across Workers in ChannelMarker by @aglinxinyuan in #3448
- Handle mismatched datasets on displaying dashboard by @xuang7 in #3446
- Use Input Port Materialization Reader Threads to Replace Cache Source Operators by @Xiao-zhen-Liu in #3425
- Persist the spec of the
computing_unitentity and its relationship withworkflow_executionsinto DB by @bobbai00 in #3400 - Add HILDA 2025 workshop paper to Readme by @shengquan-ni in #3450
- Fix issue with workspace defaulting to previous workflow after refresh by @GspikeHalo in #3342
- Add a new config to control the maximum websocket payload size by @bobbai00 in #3452
- Enable adjusting shared memory for the computing unit by @bobbai00 in #3451
- Fix operators showing "Running" state after workflow completion by @shengquan-ni in #3463
- Add domain display in email content by @GspikeHalo in #3440
- Fix project startup failure on macOS caused by attempting to retrieve the domain name by @GspikeHalo in #3465
- Move gui and user-system configurations to
core/configand host config-retrieval endpoints viacore/config-serviceby @bobbai00 in #3458 - Move all conf files and readers to
core/configby @bobbai00 in #3466 - Cleanly Enforce Input Port Dependencies by @Xiao-zhen-Liu in #3460
- Migrate existing Markers to Control Messages by @aglinxinyuan in #3432
- Remove Marker Class by @agl...
Apache Texera(Incubating) Release v1.1.0 RC 3
What's Changed
- Fix unaligned representation of python objects in BINARY format by @shengquan-ni in #3347
- Add the computing unit managing service and k8s architecture by @bobbai00 in #3337
- Fix streaming issue for single CSV export by @aicam in #3350
- Move export result resource to computing unit by @aicam in #3351
- Update the About page by @bobbai00 in #3353
- Cap the number of displayed console messages at GUI and the max length of each message by @kunwp1 in #3346
- Admins can make comments on users by @paulschatt in #3266
- Add Dendrogram Visualization Operator by @colinthebomb1 in #3352
- Avoid overlaps between name editing button and information button in operator property panel by @Ma77Ball in #3349
- Enhance Helm Charts for Easier GCP Deployment by @shengquan-ni in #3358
- Fix heap error for exporting zip result by @aicam in #3360
- Improve Helm chart and jwt secret access logic by @bobbai00 in #3357
- Add ViolinPlot Option to BoxPlot Operator by @qzt168 in #3356
- Improve Computing Unit Frontend by @bobbai00 in #3363
- Fix File Scan Operator Extract Function by @aglinxinyuan in #3367
- Fix hard-coded values and wrong configurations in Helm by @bobbai00 in #3366
- Fix the descriptions, variable names, and functions in the BoxViolinPlot operator by @qzt168 in #3365
- Add GPU support for creating the computing unit by @bobbai00 in #3371
- Enhance Visualization Grouping with New Category Hierarchy by @xuang7 in #3369
- Enable dynamic resizing for the workflow name input field in the workspace by @qzt168 in #3359
- Add slider to adjust JVM memory size when creating the computing unit by @bobbai00 in #3372
- Ensure Operator Menu Displays All First-Level Categories by @colinthebomb1 in #3364
- Fix metric-resolution for the metric server specified in the helm chart by @aicam in #3376
- Fix export in cluster mode by @aicam in #3378
- Add Bullet Chart Visualization Operator by @xuang7 in #3362
- Disabling Envoy HTTP Timeout by @kunwp1 in #3380
- Organize request and response structures by @aicam in #3379
- Fix formatting issue of envoy config yaml by @shengquan-ni in #3382
- Fix the issue where the dataset content cannot be displayed on the dataset detail page when the user is not logged in by @GspikeHalo in #3373
- Add size info for workflow json by @paulschatt in #3256
- Add new R library to Dockerfile and fix the jobs' order in Helm chart by @bobbai00 in #3386
- Add admin account in the configuration file and make local-sign-up accounts
restrictedby @bobbai00 in #3387 - Bump pyArrow to 19.0.1 by @aglinxinyuan in #3368
- Create NOTICE file by @aglinxinyuan in #3388
- Fix the issue where the File Scan operator fails to properly handle compressed files containing multiple files by @GspikeHalo in #3385
- Enable email notification to the administrator upon an unauthorized or inactive user login attempt by @GspikeHalo in #3391
- Ensure that after the workspace is opened the operator is displayed in the center by @GspikeHalo in #3344
- Set
image-pull-policywhen creating the computing unit by @bobbai00 in #3395 - Use
MultipartUploadto implement the upload-one-file endpoint of the FileService by @aicam in #3396 - Enable email notifications to users upon an inactive login attempt by @GspikeHalo in #3393
- Add if statement on catch error for
uploadOneFileToDatasetby @aicam in #3407 - Add a flag at the workflow workspace to prevent registering the auto persistence when the workflow is loading by @bobbai00 in #3412
- Fix Incorrect Version and Execution Order in Backend Queries by @kunwp1 in #3410
- Fix base image to node:18.17 in the
texera-web-application.dockerfileby @bobbai00 in #3413 - Add ASF Header to JOOQ generated files by @aglinxinyuan in #3414
- Remove the job waiter in the helm chart and make
FileServicedo forceful healthcheck by @bobbai00 in #3416 - Add ASF header and RAT CI by @aglinxinyuan in #3415
- Replace Terminate Button Icon with Close-Circle by @xuang7 in #3424
- Add RangeSlider visualization operator by @Ma77Ball in #3374
- Reimplement User Quota Service to remove MongoDB Dependency by @yunyad in #3402
- Add Nested Table Visualization Operator by @colinthebomb1 in #3397
- Sanitize Uppercase Characters to Lowercase in Dataset Names by @xuang7 in #3423
- Add Pre-Check File Size and Type Before File Preview by @xuang7 in #3427
- Revert Changes to Binary Chunking Due to Side Effects by @kunwp1 in #3431
- Display Individual Progress Indicators for Concurrent File Uploads by @xuang7 in #3426
- Add DaemonSet for ComputingUnitMaster image pre-pulling by @bobbai00 in #3419
- Install necessary packages for building gui in texera web application dockerfile by @bobbai00 in #3433
- Fix the compilation of Python Lambda Operator by @bobbai00 in #3438
- Fix ChannelMarker transmission between Java and Python by @aglinxinyuan in #3434
- Fix ChannelMarker sending on PyAmber by @aglinxinyuan in #3439
- Implement auto-expansion of the result panel upon operator click by @GspikeHalo in #3435
- Enhance user experience of comments box by @GspikeHalo in #3436
- Add Volcano Plot Visualization Operator by @SarahAsad23 in #3418
- Add
share-editing-serverandpylspdockerfiles to the deployment folder by @bobbai00 in #3441 - Fix ChannelMarker Partitioning logic on PyAmber by @aglinxinyuan in #3445
- Add HTML format & large Arrow files to result exportation by @aicam in #3429
- Make operator property sidebar dynamically adjust height based on content by @GspikeHalo in #3437
- Fix Incorrect Channel ID Inclusion Across Workers in ChannelMarker by @aglinxinyuan in #3448
- Handle mismatched datasets on displaying dashboard by @xuang7 in #3446
- Use Input Port Materialization Reader Threads to Replace Cache Source Operators by @Xiao-zhen-Liu in #3425
- Persist the spec of the
computing_unitentity and its relationship withworkflow_executionsinto DB by @bobbai00 in #3400 - Add HILDA 2025 workshop paper to Readme by @shengquan-ni in #3450
- Fix issue with workspace defaulting to previous workflow after refresh by @GspikeHalo in #3342
- Add a new config to control the maximum websocket payload size by @bobbai00 in #3452
- Enable adjusting shared memory for the computing unit by @bobbai00 in #3451
- Fix operators showing "Running" state after workflow completion by @shengquan-ni in #3463
- Add domain display in email content by @GspikeHalo in #3440
- Fix project startup failure on macOS caused by attempting to retrieve the domain name by @GspikeHalo in #3465
- Move gui and user-system configurations to
core/configand host config-retrieval endpoints viacore/config-serviceby @bobbai00 in #3458 - Move all conf files and readers to
core/configby @bobbai00 in #3466 - Cleanly Enforce Input Port Dependencies by @Xiao-zhen-Liu in #3460
- Migrate existing Markers to Control Messages by @aglinxinyuan in #3432
- Remove Marker Class by @agl...
Apache Texera(Incubating) Release v1.1.0 RC
What's Changed
- feat(auth): removed token refresh functionality by @Sentiaus in #3765
- feat(access-control-service): AccessControlService added as a new micro service by @aicam in #3767
- fix: Persist & apply “Data Transfer Batch Size” from Workflow Settings by @mengw15 in #3770
- fix(op): fix incorrect names for Machine Learning Operators by @aglinxinyuan in #3776
- feat(op): add ML training operators for linear and logistic regression by @yunyad in #3779
- feat: cap number of concurrent file uploads by @xuang7 in #3735
- fix: Correctly format workflow JSON when downloading from hub by @colinthebomb1 in #3751
- fix(ui): Prevent link delete buttons from overlapping by @Ma77Ball in #3602
- chore: Add publish section with whoami to .asf.yaml by @pjfanning in #3785
- fix(gui): reopen workflow websocket when switching workflows by @seongjinyoon in #3773
- fix(ui): move quota diagrams to new tab by @gracecluvohio in #3781
- fix(ci): add explicit jdbc driver loading in relevant e2e tests by @Xiao-zhen-Liu in #3783
- fix: pin R arrow package to version 14.0.2.1 in computing-unit-master by @bobbai00 in #3761
- chore(deps): bump torch from 2.4.1 to 2.8.0 and typing_extensions from 4.9.0 to 4.10.0 by @mengw15 in #3800
- chore(deps): bump scikit-learn from 1.4.0 to 1.5.0 in /core/amber by @dependabot[bot] in #3799
- chore: refactor user avatar component by @aglinxinyuan in #3790
- chore(deps): bump axios from 1.7.7 to 1.12.2 in /core/gui by @dependabot[bot] in #3797
- chore(deps): bump form-data from 3.0.2 to 3.0.4 in /core/gui by @dependabot[bot] in #3793
- fix(debugger): do not truncate debug messages by @Yicong-Huang in #3786
- feat(config): allow default.conf to be set using environment variables by @bobbai00 in #3757
- feat: display datasets version creation timestamp by @xuang7 in #3803
- chore: Upgrade checkout and setup actions to latest versions by @pjfanning in #3808
- feat: add save button for sidebar tabs on the admin settings page by @xuang7 in #3801
- feat(dataset): allow datasets to have the same name across different users by @AnzhiZhang in #3715
- ci: automatically label PRs by @Yicong-Huang in #3536
- ci: fix labeler.yml glob pattern syntax by @Yicong-Huang in #3813
- ci: fix labeler configuration for file glob patterns by @Yicong-Huang in #3814
- feat(gui): display the owner's avatar next to each CU by @yunyad in #3784
- fix(ci): explicitly load jdbc driver when accessing iceberg postgres catalog by @Xiao-zhen-Liu in #3817
- fix(ci): narrow backend file patterns in labeler.yml by @Yicong-Huang in #3815
- fix: delete button only clickable from outer ring by @Ma77Ball in #3809
- chore: add bug report template by @Yicong-Huang in #3812
- chore(deps): bump @babel/helpers from 7.25.7 to 7.28.4 in /core/gui by @dependabot[bot] in #3795
- fix: conditionally show upload panels based on upload activity by @xuang7 in #3818
- fix(image): exec format error in pylsp Docker container by @bobbai00 in #3811
- fix(gui): keep words complete in the workspace' comment boxes by @colinthebomb1 in #3597
- chore: Improve Bug Issue Template by @kunwp1 in #3826
- chore(deps): bump transformers from 4.44.2 to 4.53.0 in /core/amber by @dependabot[bot] in #3635
- fix(deployment): deduplicate SQL files under
deploymentby using symlinks by @bobbai00 in #3824 - chore(ci): automatically assign an author of a PR by @Yicong-Huang in #3835
- feat(amber): enable user system by default by @Xiao-zhen-Liu in #3782
- chore(deps): bump prismjs from 1.29.0 to 1.30.0 in /core/gui by @dependabot[bot] in #3796
- fix(access-control-service): fix formatting of
AccessControlResourceby @aicam in #3838 - chore(ci): revert automatically assign an author of a PR by @Yicong-Huang in #3839
- fix(access-control-service): fix service file formats by @aicam in #3840
- feat(access-control-service): add
access-control-serviceto authorize the requests to/wsapiandComputing Unitendpoints by @aicam in #3598 - fix(file-service): avoid manually constructing the Dataset object through parameters by @bobbai00 in #3820
- refactor(config): remove user-sys enabled flag by @Xiao-zhen-Liu in #3831
- fix(gui): enable frontend undo-redo with bug-fix for shared editing by @Xiao-zhen-Liu in #3836
- feat: enable canceling pending uploads before upload begins by @xuang7 in #3847
- feat: improve
file-serviceDocker Compose file by @AnzhiZhang in #3834 - feat: prevent export of results from non-downloadable datasets by @seongjinyoon in #3772
- feat: switch workflow result downloads to use browser native downloads by @madisonmlin in #3728
- fix(ui): remove duplicate left sidebar border and extend to bottom of page by @gracecluvohio in #3849
- chore: rename packages to org.apache by @aglinxinyuan in #3848
- chore: relocate core/gui to frontend by @Yicong-Huang in #3851
- chore: relocate root project build out from core by @Yicong-Huang in #3852
- chore: relocate access-control-service by @Yicong-Huang in #3866
- chore: relocate computing-unit-managing-service by @Yicong-Huang in #3867
- fix: Clear Python executable path in udf.conf by @aglinxinyuan in #3869
- fix: use
TEXEAR_HOMEor the current directory as the default working directory by @Yicong-Huang in #3870 - fix: use frontend as the logo path by @Yicong-Huang in #3876
- chore: relocate workflow-compiling-service by @Yicong-Huang in #3868
- chore: relocate file-service by @Yicong-Huang in #3873
- chore: relocate config-service by @Yicong-Huang in #3874
- chore: relocate
core/pyright-language-serverand rename it topyright-language-serviceby @Yicong-Huang in #3875 - chore: rename
core/scriptstobinby @Yicong-Huang in #3877 - chore: relocate amber by @Yicong-Huang in #3878
- chore: remove
core/logby @Yicong-Huang in #3881 - chore: move
deploymenttobin/deploymentby @Yicong-Huang in #3883 - chore: rename
coretocommonby @Yicong-Huang in #3882 - chore: move
bin/sqltosqlby @Yicong-Huang in #3884 - fix: use
TEXERA_HOMEfor jooq by @Yicong-Huang in #3886 - fix: move
.scalafix.confout to root by @Yicong-Huang in #3888 - fix: use
logs/for all services by @Yicong-Huang in #3887 - fix: remove a duplicated python-language-service.sh by @Yicong-Huang in #3889
- chore(deps): bump brace-expansion from 1.1.11 to 1.1.12 in /core/gui by @dependabot[bot] in #3798
- refactor(frontend): Simplify onClickToggleGrids logic by @aglinxinyuan in #3896
- fix: restructure bin/deployment and update Dockerfiles for new repository structure by @bobbai00 in #3892
- feat(frontend): introduce dropdown menu for layers by @aglinxinyuan in #3898
- feat(bin): add some helper scripts by @Yicong-Huang in #3890
- fix(deployment): add image tag for MinIO image in helm chart by @aicam in #3903
- chore(doc): create pull request template for contributors by @Yicong-Huang in #3899
- feat(op): add Substring Sear...
Apache Texera(Incubating) Release v1.1.0
Release Highlights
1. Feature Upgrades
- Advanced Computing Unit Feature: We refined the Kubernetes-based computing unit layer by improving the UI/UX and enabling shared-memory control, GPU support and JVM memory tuning (#3337, #3363, #3371, #3372, #3400, #3451, #3591, #3604).
- Dynamic Admin Configuration System: Site settings are now stored in the database, and admins can customize logos and favicons with a live preview of branding changes (#3453, #3480, #3700, #3743).
2. Operators
- New Visualization Operators: We added Choropleth Map, Bullet Chart, Dendrogram, Violin Plot, Volcano Plot, Nested Table, Range Slider, and Time Series Plot operators (#3352, #3356, #3362, #3374, #3397, #3418, #3531, #3744).
- Machine Learning Operators: We introduced a suite of scikit-learn training operators (#3535).
- Utility Operators: We added a Sleep operator, enhanced the Generator with examples, and improved the Projection operator with column re-ordering (#3537, #3543, #3661).
- Enhanced Visualization System: We organized operators with a new category hierarchy, improved grouping, and first-level category display (#3364, #3369).
3. Authentication & Security
- Admin Account Management: Admin accounts can be auto-registered from configuration, and admins can leave comments to users (#3266, #3387, #3625).
- Security Enhancements: The system tracks user activity with last-login times, separates JWT expiration from activity windows, and sends email alerts for unauthorized access attempts (#3391, #3393, #3541, #3693, #3694, #3703, #3731).
- Access Control Improvements: We refined JWT secret access logic, optimized token refresh behavior, and added login confirmation requirements (#3357, #3703).
4. Infrastructure & Deployment
- Apache Foundation Compliance: We added ASF headers, NOTICE and DISCLAIMER files, integrated Apache Rat checks in CI, and updated license headers (#3388, #3414, #3415, #3631, #3632, #3637).
- Kubernetes and Helm Enhancements: We added GCP deployment support, a DaemonSet for image pre-pulling, ingress class configuration, and improved job ordering (#3358, #3366, #3376, #3386, #3419, #3652).
- Build System Improvements: We automated jOOQ code generation in SBT, removed jOOQ from CI, and migrated builds to a JDK to support Java UDFs (#3628, #3670, #3689, #3746, #3747).
- Configuration Management: We centralized configuration in a dedicated service, added environment-variable defaults, and stored site settings in the database (#3458, #3466, #3622).
- Centralized Config Service: We introduced a centralized config-service with database-backed dynamic settings, enabling admins to modify branding, upload limits, authentication parameters, and system configurations through a web dashboard without service restarts or redeployment (#3453, #3458, #3466, #3480, #3511, #3522, #3533, #3622, #3664, #3693, #3700, #3743).
- Dependency Updates: We upgraded PyArrow from 19.0.1 to 21.0.0, updated jOOQ to 3.16.23, and updated Formly to 6.3.12 (#3368, #3565, #3600, #3687).
5. UI and UX Improvements
- Workflow Workspace Enhancements: The workspace now supports dynamic name-field resizing, double-click to open operator results, auto-centering, broken-workflow alerts, and reliable refresh behavior (#3342, #3359, #3573, #3603, #3606, #3636).
- Result Panel Improvements: The result panel shows a compact statistics row, supports resizable panels, exports to HTML and Arrow, and auto-expands on click (#3429, #3435, #3568).
- Visual Design Updates: We introduced a new Apache Texera logo, clarified button styling, added an adaptive navigation menu, and improved port statistics displays (#3481, #3483, #3486, #3489, #3569, #3585, #3601, #3722).
- Dataset UI Improvements: The dataset UI shows version creation dates with timezone tooltips, displays upload speed and time estimates, updates progress in real time, and validates file sizes (#3426, #3427, #3612, #3662, #3702, #3706).
- Admin Dashboard Enhancements: The admin dashboard includes user activity metrics, a better comment interface, column layout refinements, and a live branding preview (#3436, #3464, #3594, #3625, #3700).
- Editor Experience: The editor shows operator descriptions in panels, supports dynamic sidebar customization, improves edge deletion and reconnection, and organizes the operator menu by category (#3349, #3364, #3411, #3437, #3479, #3533).
v1.0.0
Since 2016, our team has been developing the Texera system (https://texera.io/) to support cloud-based data science, AI, and ML using GUI-based workflows. Today (April 7, 2025), we are glad to make an official release of its first major version, v1.0.0!
Major Features
- Supporting low/no coding data science using workflows
- Parallel data-processing engine running on computing clusters
- Using the Apache Pekko actor-model system
- Supporting UDFs in Python, R, and Java
- Supporting ML training and inference
- Including a rich collection of ML operators
- Interactive workflow execution model that supports pausing and resuming
- Supporting collaborations with shared editing, shared execution, and version control
- Supporting debugging, including line-by-line debugging in Python UDFs
- Supporting reproducibility of data analysis
- Region-by-region execution with full pipelining in each region
- Storing execution results using Apache Iceberg
- Supporting version-controlled file collections on S3-compatible storage managed by LakeFS
- Adopting a microservice-based architecture using Kubernetes and Docker
- Supporting computing isolation and storage isolation of multiple tenants
For users and developers
Texera first beta release 0.1.0
After the renaming process is finished, we publish the first release of Texera (version 0.1.0)