Connecting to HiveServer2 in a Hadoop 2.x HA Cluster using DBVisualizer
In a previous post, the steps for downloading and configuring DBVisualizer to connect to Hive were presented. The connection was made using a Hive Host Name in a Hadoop cluster with a single Namenode....
View ArticleConfiguring Talend Hive Components: HiveServer1 vs. HiveServer2, Embedded vs....
Talend Hive components have a number of somewhat confusing options that could be tricky to understand when making connections to a Hadoop cluster. Options include selecting between HiveServer1 and...
View ArticleIntroducing Talend 6.0
Talend 6 was released in September 2015 and with it come a number of new and important features and updates, including product name changes. One of the most significant features introduced in Talend...
View ArticleDynamic Select SQL statement execution for moving data from DB2 to Hadoop...
Data warehousing and ETL processes usually repeat common patterns across different data domains (databases, tables, subject areas etc...). One such pattern is copying data from a transactional system...
View Article'Not implemented by the DistributedFileSystem FileSystem implementation'...
'Not implemented by the DistributedFileSystem FileSystem implementation' error occassionally rears its head when debugging Talend Big Data jobs. This is a cryptic message that actually intends to...
View ArticleEnabling Hive High Availability in Talend Studio
Starting with Talend 5.6.1, a patch was released by Talend to update Hive components to be able to connect to Hadoop Clusters configured for HA - High Availability. In HA, instead of configuring the...
View ArticleConnecting Talend to an MS SQL Server DB with 'applicationIntent=ReadOnly'
Out of the box, Talend uses the open source jTDS driver to connect to MS SQL Server databases. This driver however does not support connecting to an AlwaysOn enabled database. A generic jdbc driver...
View ArticleBest practice for creating Talend metadata connections to NoSQL DBs and Hadoop
Starting with Talend 5.6.2, it is now possble to create metadata connections for NoSQL databases and Hadoop platforms using the metadata feature in the Studio. Even better, the Studio now allows...
View ArticleTesting Hive or MySQL database connections directly using JDBC driver
Every once in a while, one runs into a situation that one can not connect to a database from a tool or application. When this happens, the best way to isolate the issue is to try to connect to the same...
View ArticleTalend connection component retry logic & design for databases, web services...
For the instances where for whatever reason, a Talend job does not always connect to a backend service layer - database, web service, ftp, salesforce, dropbox - on the first try, the job can be...
View Article