Top Banner
Product Summary Fast Manipulation of Big Data • Transformation • Conversion • Protection • Reporting
4

Fast Manipulation of Big Data - IBM

Oct 02, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Fast Manipulation of Big Data - IBM

Product Summary

Fast Manipulation of Big Data

• Transformation• Conversion• Protection• Reporting

Page 2: Fast Manipulation of Big Data - IBM

CoSort: It’s About Time• Fast,ConsolidatedBigDataProcessing

CoSort combines speed with versatility. Only with CoSort, can you simultaneously transform, convert, protect, and report:

• AnyDataVolume,Source,andType,RelationalTableandFlat-FileFormatCoSort collates and converts >100 data types, including EBCDIC, zoned decimal, IP addresses, Unicode, COBOL, VarChar, multi-byte Asian characters, and timestamps. CoSort can convert files between CSV, ISAM, LDIF, Vision, XML, and several other sequential file formats, while also moving between big- and little-endian formats. Unstructured, semi-structured, and other custom data sources can be accomodated through custom input procedures, and other methods, at a fraction of their cost.

• FastDatabaseandDataWarehouseOperationsCoSort can select, join, order-by, and group-by faster than DB2, Oracle, Informatica, DataStage, by sorting, joining and segmenting billions of flat-file rows, CoSort captures changed data and speeds bulk loads while reducing database overhead and storage space.

• TargetedDataSecurityCoSort can de-identify, encrypt, filter, mask, pseudonymize, and randomize sensitive data at the field level. CoSort can secure sensitive fields in tables and files, create an audit trail, and leave remaining data alone.

• Simple,Centralized,OpenMetadataCoSort runs with familiar and explicit data definition and manipulation statements residing in reusable text repositories. Several applications create or leverage CoSort metadata, including IRI’s Fast Extract (FACT) and RowGen (test data) tools, the RapidACE data model consolidator, the Meta Integration Model Bridge (MIMB), and the following:

Legacy Sort and Metadata Migration Tools• JCL: Converts MVS and VSE sort parameters to CoSort (SortCL) job scripts.• MicroFocusCOBOL: Drop-in replacements for ACUCOBOL-GT and Micro Focus COBOL (Net Express, Server Express and

Workbench) runtime sort verbs. CoSort also collates and converts Vision, I-SAM, and MF Variable Length files. • COBOLCopybooks: Converts COBOL FDs to SortCL data definitions.

• OtherMetadata: CSV, CTL, ELF, LDIF and XML file parsers create SortCL data definitions.

Third-Party Sort Replacements• AmdocsEnsemble- via legacy sort replacement• IBMWebSphereDataStage- exclusive sort stage plug-in• InformaticaPowerCenter - exclusive Sorter Tx custom transform• ClerityMainframeRehostingSoftware - UniKix MBE/TPE sorts• UNIX- /bin/sort command

Page 3: Fast Manipulation of Big Data - IBM

CoSort Speeds ETL Tasks & Tools• ETLTools- ETI Solution, IBM DataStage, Informatica

PowerCenter, Microsoft SSIS, Oracle Data Integrator, Pentaho Sqoop, Talend

• BITools- BIRT, BOBJ, Cognos, Excel, MicroStrategy, QlikView, OBIEE

• AnalyticTools - ActuateOne, Paques, R, SAS, Splunk, Spotfire, Tableau

• Databases - Altibase, DB2, MySQL, Oracle, SQL Server, Sybase, Tibero

Supported Platforms• Linux on x86, Itanium, IBM s/p/i/z, FreeBSD• UNIX (AIX, HP-UX, Solaris, Tru64 & more)• Windows® (XP, 2000/2003/2008, Vista, 7, 8)

By pre-sorting with CoSort, bulk loads are faster, which makes it easier to maintain more tables in (fast) query order.

Data & File Sources• ASCII, EBCDIC, COBOL and C (binary) forms• European, ISO, Japanese & U.S. Timestamps• IP Addresses, Whole Numbers• Custom Input Procedures (e.g. UIMA)• Bulk RDBMS Unloads - via Fast Extract (FACT)• DB Tables and Excel Spreadsheets• ACUCOBOL-GT (Vision) Indexed Files• IBM Unblocked Variable Record Format• LDIF (LDAP), Microsoft CSV, Flat XML• Micro Focus Variable Length & I-SAM Files• Sequential Flat Files (Line, Record, Variable)• Fixed Block File Format• VSAM - via Clerity Mainframe Re-hosting• W3C Common & Extended Log (Web)• Unicode and Native Multi-Byte Character Sets

Compatible Products• Analytix DS Mapping Manager• BIRT - Eclipse Reporting Tool• FACT - Fast Extract for Oracle, DB2, et al.• FieldShield - Audited Data-Centric Security• iDashboards - Visual BI• MIMB - Meta Integration Model Bridge• NextForm - File and Data Type Conversion• RapidACE - 3D Data Model Integration• RowGen - Referentially Correct Test Data• Trillium - Data Quality

IRI Workbench - Integrated Development Environment Built on EclipseCoSort users on Windows get a free Eclipse plug-in to create, run, and manage their data manipulation and management activities. The GUI contains several job wizards and functional dialogs, plus a syntax-aware script editor, dynamic outlines for SortCL jobs and metadata, Sirius™ visual workflow infrastructure, and an optional spreadsheet-style code builder for source-to-target mappings.

CoSort Speeds VLDB Loads & Reorgs

Page 4: Fast Manipulation of Big Data - IBM

CoSort Business Benefits• ParalleltransformsincreasedataavailabilityfordatawarehouseETL,ELT,andODSstaging• Taskconsolidationspeedsruntimes,minimizesCPUcycles,anddelayshardwareupgrades• Fast,feature-rich4GLcanreplaceslower3GLprograms,shellscripts,andSQLprocedures• Bigdataintegrationandreformattingcapabilitiesrapidlyfranchisedataforbusinessintelligencetools• Intuitiveandopenjobscriptsandmetadatareducedevelopmenttrainingandmaintenancecost• FamiliarEclipseGUIfacilitatesjobdesign,cross-platformexecution,andprojectmanagement• Seamlesssortaccelerationplug-insoptimizeROIforpackagedapplicationsandETLtools• Built-indetail,delta(changeddatacapture),andsummaryreportingmarriesdecisionstotransformations• Built-infieldsecurityreducesriskoffines,branddamage,andlitigation• Flexible,perpetual-uselicensingmodelsloweracquisitionandownershipcosts

2194HighwayA1A,3rdFloorMelbourne,FL32937USA

1.321.777.88891.800.333.SORT WWW.IRI.COM

Copyright©2015,InnovativeRoutinesInternational(IRI),Inc.AllRightsReserved.CoSort,FieldShield,NextForm,RowGen,andVoracityareregisteredtrademarksofIRI,Inc.FACTisatrademarkofDataStreamsCorp.(a/k/aCoSortKorea).Otherproduct,brand,orcompanynamesare,ormaybe,(registered)trademarksoftheirrespectiveholders.

• AmericanAirlines-Sabre“Fortune Magazine says our yield maximization system is private industry’s most frequently updated database -- one of the world’s largest! We chose CoSort to quarter the time batch reorgs take our airline clients on their SMP systems. The CoSort team responds rapidly and fairly to our technical and business needs.”• BluePhoenix“We completed a complete migration of a large MVS/ADSO environment (CICS and BATCH) to a Sun Solaris/Oracle solution. The sorting requirements were very complex with a lot of totaling and speed requirements. We used CoSort and accomplished the mission successfully. The nature of the project required a lot of technical interaction and support from the CoSort team. They were very responsive, very helpful and very professional. In our experience the CoSort solution provides the best value for performance, support, and cost of ownership.”

• Comcast“The Comcast Data Engineering and Management Integration (DEMI) organization works with 10 terabytes of DAP (Directory Access Protocol) data on a daily basis as we work to distribute business critical information resources to the rest of the company. The fact is we would not be able to pull this off successfully without CoSort. It accurately and quickly processes billions of rows of DAP data and allows us to join and analyze this information in connection with our other data warehouse processes. No other tool gives us this much speed and flexibility and allows the processing of this volume of flat-file LDIF records. The very talented CoSort team worked directly with us to develop their module and was able to turn it around very quickly. In turn they have developed a long-term customer relationship with America’s largest Cable Operator and a large (50 TB) and growing data warehouse.”

Typical CoSort Users