延伸資訊
解決方案
商業智慧 BI
建置建議與相關服務請參考
BI / ETL 服務

PDI Plugins

Pentaho Data Integration 預設提供大量的Plugins, 可讓開發人員設計Job、Transformation的Steps時使用。
Pentaho Plugins是開放架構,開發人員也可以從PDI Marketplace安裝擴充的Plugins,或直接至自Pentaho 可用Plugins下載, 或是自行開發Plugins。
PDI Plugins:
Input
  • CSV File Input
  • Data Grid
  • De-serialize From file
  • Email Messages Input
  • ESRI Shapefile Reader
  • Generate random credit card numbers
  • Generate Rows
  • Get data from XML
  • Get File Names
  • Get Files Rows Count
  • Get Repository Names
  • Get SubFolder Names
  • Get System Info
  • Get Table Names
  • Google Analytics
  • Google Docs Input
  • GZIP CSV Input
  • HL7 Input
  • IBM Websphere MQ Consumer
  • JMS Consumer
  • Json Input
  • LDAP Input
  • LDIF Input
  • Microsoft Access Input
  • Microsoft Excel Input
  • Mondrian Input
  • OLAP Input
  • Property Input
  • RSS Input
  • S3 CSV Input
  • Salesforce Input
  • SAP Input
  • SAS Input
  • Table Input
  • Text file Input
  • XBase Input
  • XML Input Stram(StAX)
  • Yaml Input
Output
  • Automatic Documentation Output
  • Delete
  • IBM Websphere MQ Producer
  • Insert / Update
  • JMS Producer
  • Json Output
  • LDAP Output
  • Microsoft Access Output
  • Microsoft Excel Output
  • Microsoft Excel Writer
  • Pentaho Reporting Output
  • Properties Output
  • RSS Output
  • S3 File Output
  • Salesforce Delete
  • Salesforce Insert
  • Salesforce Update
  • Salesforce Upsert
  • Serialize to File
  • SQL File Output
  • Synchronize after merge
  • Table Output
  • Text File Output
  • Update
Transform
  • Add a Checksum
  • Add Constants
  • Add Sequence
  • Add Value Fields Changing Sequence
  • Add XML
  • Calculator
  • Closure Generator
  • Concat Fields
  • Example Plugin
  • Get ID from Slave Server
  • Number Range
  • Replace in String
  • Row Denormaliser
  • Row Flattener
  • Row Normaliser
  • Select Values
  • Set Field Value
  • Set Field Value to Constant
  • Sort Rows
  • Split Field to Rows
  • Split Fields
  • String Operations
  • Strings Cut
  • Unique Rows
  • Unique Rows(HashSet)
  • Value Mapper
  • XSL Transformation
Utility
  • Clone Row
  • Edi to XML
  • Execute a Process
  • If Field Value is Null
  • Mail
  • Metadata Structure of Stream
  • Null If...
  • Process Files
  • Run SSH commands
  • Send Message to Syslog
  • Table Compare
  • Zip File
Flow
  • Abort
  • Append Streans
  • Block this Step Until Steps Finish
  • Block Step
  • Detect Empty Stream
  • Dummy(Do Nothing)
  • ETL Metadata Injection
  • Filter Rows
  • Identify Last Row in a Stream
  • Java Filter
  • Job Executor
  • Prioritize Streams
  • Single Threader
  • Switch / Case
  • Transformation Executor
Scripting
  • Excuete Row SQL Script
  • Execute SQL Script
  • Formula
  • Modified Java Script Value
  • Regex Evaluation
  • Rule Accumulator
  • Rule Executor
  • User Defined Java Class
  • User Defined Java Expression
Lookup
  • Call DB Procedure
  • Check If a Column Exists
  • Check If File is Locked
  • Check If WebService is available
  • Database Join
  • Database Lookup
  • Dynamic SQL Row
  • File exists
  • Fuzzy match
  • HTTP Client
  • HTTP Post
  • REST Client
  • Stream Lookup
  • Table exists
  • Web Services Lookup
Joins
  • Join Rows(Cartesian Proceduct)
  • Merge Join
  • Merge Row(Diff)
  • Sorted Merge
  • XML Join
Data Warehouse
  • Combination Lookup/Update
  • Dimension Lookup/Update
Validation
  • Credit Card Validator
  • Data Validator
  • Mail Validator
  • XSD Validator
Statistics
  • Analytic Query
  • Group by
  • Memory Group by
  • Output Steps Metrics
  • Reservoir Sampling
  • Sample Rows
  • Univariate Statistics
Data Mining
  • Arff Output
  • Knowldge Flow
  • Weka Scoring
Big Data
  • Avro Input
  • Cassandra Input
  • Cassandra Output
  • CouchDb Input
  • Hadoop File Input
  • Hadoop File Output
  • HBase Input
  • HBase Output
  • MapReduce Input
  • MapReduce Output
  • MongoDB Input
  • MongoDB Output
  • Splunk Input
  • Splunk Output
  • SSTable Output
Agile
  • MonetDB Agile Mart
  • Table Agile Mart
Cryptography
  • PGP Decrypt Stream
  • PGP Encrypt Stream
  • Secret Key Genrator
  • Symmetric Cryptography
Palo
  • Palo Cell Input
  • Palo Cell Output
  • Palo Dim Input
  • Palo Dim Output
OpenERP
  • OpenERP Object Delete
  • OpenERP Object Input
  • OpenERP Object Output
Job
  • Copy Rows to result
  • Get Files from result
  • Get Rows from result
  • Get Variables
  • Set Files in result
  • Set Variables
Mapping
  • Mapping(sub-transformation)
  • Mapping Input Specification
  • Mapping Output Specification
  • Simple Mapping(sub-transformation)
Bulk Loading
  • ElasticSearch Bulk Insert
  • Infobright Loader
  • Ingres VectorWise Bulk Loader
  • LucidDB Streaming Loader
  • MonetDB Bulk Loader
  • MySQL Bulk Loader
  • Oracle Bulk Loader
  • PostgresSQL Bulk Loader
  • Teradata Fastload Bulk Loader
  • Vertica Bulk Loader
Inline
  • Injector
  • Socket Reader
  • Socket Writer
Experimental
  • Script
  • SFTP Put
Deprecated
  • Aggregate Rows
  • Get Previous Row Fields
  • Google Analytics Input
  • LucidDB Bulk Loader
  • Streaming XML Input
History
  • Text File Input
  • Modified Java Script Value
  • Table Input
  • Text File Output
  • Get System Info
  • Sort Rows
  • Sorted Merge
  • Generate Rows
  • Merge Rows(diff)