Cloud Data Fusion version 6.10.1 is generally available (GA)
Share
Services
## Feature
Cloud Data Fusion version 6.10.1 is generally available ([GA](https://cloud.google.com/products#product-launch-stages)). This release is in parallel with the CDAP 6.10.1 release.
## Feature
Creating a private instance with [Private Service Connect](https://cloud.google.com/data-fusion/docs/how-to/configure-private-service-connect) is GA in Cloud Data Fusion version 6.10.1.
## Feature
Per Namespace Service Accounts are GA in Cloud Data Fusion version 6.10.1\. For more information, see [Access control with namespace service accounts](https://cloud.google.com/data-fusion/docs/how-to/control-access-in-namespace).
## Feature
Syncing multiple pipelines from a namespace is GA in Cloud Data Fusion version 6.10.1, For more information, see [Sync Cloud Data Fusion pipelines with a remote repository](https://cloud.google.com/data-fusion/docs/how-to/source-control-management#sync-pipelines).
## Change
Changed in Cloud Data Fusion 6.10.1:
* [Source Control Management](https://cloud.google.com/data-fusion/docs/how-to/source-control-management) supports Bitbucket and Gitlab.
* Cloud Data Fusion uses the subnet used by the shared VPC network attachment in the default compute profile.
* Added support for option string field (keep-strings) in `parse-xml-to-json` Wrangler directive ([CDAP-20934](https://cdap.atlassian.net/browse/CDAP-20934)).
* The BigQuery sink plugin doesn't provide the **Dedupe By** option while in insert mode ([PLUGIN-900](https://cdap.atlassian.net/browse/PLUGIN-900)).
* The BigQuery plugin supports the JSON type ([PLUGIN-1563](https://cdap.atlassian.net/browse/PLUGIN-1563)).
* Improved error messages in the Spanner source ([PLUGIN-1748](https://cdap.atlassian.net/browse/PLUGIN-1748)).
* Improved retries in PubSub plugin ([PLUGIN-1769](https://cdap.atlassian.net/browse/PLUGIN-1769)).
## Fix
Fixed in Cloud Data Fusion 6.10.1:
* Fixed an issue causing runtime arguments of pipeline triggers to not propagate to downstream pipelines ([CDAP-20947](https://cdap.atlassian.net/browse/CDAP-20947)).
* Fixed an issue in Wrangler causing the `send-to-error-and-continue` directive to not initialize `dq_failure` when the condition is false ([PLUGIN-1736](https://cdap.atlassian.net/browse/PLUGIN-1736)).
* Fixed an issue that occurs if running a replication pipeline when task workers are enabled ([CDAP-20951](https://cdap.atlassian.net/browse/CDAP-20951)).
* Improved error reporting in the BigQuery Sink. Fixed an issue in BigQuery Argument Setter where validation error wasn't displayed correctly ([PLUGIN-788](https://cdap.atlassian.net/browse/PLUGIN-788), [PLUGIN-781](https://cdap.atlassian.net/browse/PLUGIN-781),
* [PLUGIN-1318](https://cdap.atlassian.net/browse/PLUGIN-1318)).
* Improved retries in BigQuery plugin ([PLUGIN-1715](https://cdap.atlassian.net/browse/PLUGIN-1715)).
* Fixed an issue with the Python plugin, where running in native mode doesn't work as intended ([PLUGIN-1617](https://cdap.atlassian.net/browse/PLUGIN-1617)).
* Fixed an issue causing certain connection parameters to not propagate in a MySQL connection ([PLUGIN-1728](https://cdap.atlassian.net/browse/PLUGIN-1728)).
* Fixed an issue causing the Cloud Storage Copy action to timeout while working with large files ([PLUGIN-1735](https://cdap.atlassian.net/browse/PLUGIN-1735)).
* Fixed an issue causing Copy and Move plugins to not create buckets at the destination path as expected, resulting in a runtime error ([PLUGIN-1738](https://cdap.atlassian.net/browse/PLUGIN-1738)).
* Fixed an issue causing empty source input to fail in multiple plugins ([PLUGIN-1742](https://cdap.atlassian.net/browse/PLUGIN-1742)).
* Fixed an issue with remote execution of Wrangler directives causing type information to not be emitted ([PLUGIN-1778](https://cdap.atlassian.net/browse/PLUGIN-1778)).
* Fixed an issue causing a `No record field provided`error ([CDAP-21024](https://cdap.atlassian.net/browse/CDAP-21024)).
* Streaming pipelines in Cloud Data Fusion support the Excel source. Batch pipelines with an Excel source can consume high memory and fail in large pipelines ([PLUGIN-1771](https://cdap.atlassian.net/browse/PLUGIN-1771)).
* Fixed an issue with using the Conditional plugin as a source for Wrangler, causing CDAP not to fetch the necessary schema ([CDAP-20890](https://cdap.atlassian.net/browse/CDAP-20890)).
* Fixed an issue with instance upgrades causing existing schedule names to be improperly encoded in the URL, resulting in pre-upgrade failure ([CDAP-20999](https://cdap.atlassian.net/browse/CDAP-20999)).
* Fixed an issue with schedules causing the maximum concurrent run property to not work as intended ([CDAP-20988](https://cdap.atlassian.net/browse/CDAP-20988)).
* Fixed an issue causing committed ID to incorrectly propagate when pushing pipeline configurations to Git ([CDAP-20932](https://cdap.atlassian.net/browse/CDAP-20932)).
## Breaking
Cloud Data Fusion version 6.10.1 has a known issue in the [Cloud Storage plugin](https://cloud.google.com/data-fusion/docs/how-to/configure-cloud-storage-batch-source) causing pipelines to intermittently fail if the plugin contains a \* regex pattern and uses Dataproc 2.0\. To mitigate this issue:
* [Change the Dataproc image to version 2.1](https://cloud.google.com/data-fusion/docs/how-to/change-dataproc-image#change-to-dataproc-21).
* Use an older plugin version.
* [Increase memory for the executor](https://cloud.google.com/data-fusion/docs/how-to/change-dataproc-image#memory-usage).