Insert update strategy when using Talend for aws redshift database

Saturday, 18 July 2015

Insert update strategy when using Talend for aws redshift database

Insert update strategy when using Talend for aws redshift database

In this blog i want to highlight insert and update strategy which i found working good with aws redshift database. There would be other strategies which would also work good, but i would share my findings - when i used Talend with redshift.

As of tos v5.6.1 - there is no bulk output component for redshift. so if you use output component for redshift it is slow when it comes to insert, update. What redshift works best is with using their data load utility of COPY from flat file in S3 or it works best when you want to insert data using custom SQL "insert into targettable select from sourcetable";

similarly for update it works best if you use update from table as below

update targettable

set targettable.column = sourcetable.column

from sourcetable

where tarettable.columnkey = sourcetable.columnkey.

So in my work with talend and redshift I mostly endup in using tRedshiftRow component where you can write your custom SQL for insert, update or copy command and this is processed by redshift very fast. There is magnitude of difference in performance when you use custom SQLfor insert, update or copy command.

So when working on redshift DB via talend, try to design your jobs in the way that for jobs which process bulk data, you generate a flat file from source, push it to S3 and use tRedshiftRow to write copy command to load it. If you want to transform data across your tables in your database then write your own insert, update statement for better performance.

2 comments:

likitha5 October 2018 at 22:51
Nice work, your blog is concept oriented ,kindly share more blogs like this
AWS Online Course
ReplyDelete
Replies
Ayush30 March 2025 at 07:03
Just as using tJavaRow for row-by-row transformations can streamline your data processes, gaining expertise in medical coding can streamline your career path in the healthcare industry. If you're in Delhi, consider enrolling in a medical coding course to develop valuable skills and unlock new opportunities!
Medical Coding Courses in Delhi
ReplyDelete
Replies

Add comment

Talend - Data Integration using talend

Saturday, 18 July 2015

Insert update strategy when using Talend for aws redshift database

2 comments:

Total Pageviews