HDFS Binary File Reader/Writer Adapter Sample

About This Sample

This sample demonstrates the use of the TIBCO StreamBase® Binary File Reader and Writer Adapters for Apache HDFS. HDFS is the well-known acronym for Apache's Hadoop Distributed File System.

Initial Setup

You must open the sample application, binaryrw.sbapp and select the Parameters tab and edit the HDFS_FILE_PATH and HDFS_USER to represent:

  • your current HDFS setup

  • where you would like to store the sample data

Importing This Sample into StreamBase Studio

In StreamBase Studio, import this sample with the following steps:

  • From the top menu, click FileLoad StreamBase Sample.

  • Type hdfs to narrow the list of options.

  • Select hdfsbinaryrw from the Large Data Storage and Analysis category.

  • Click OK.

StreamBase Studio creates a project for this sample.

Running binaryrw.sbapp Sample in StreamBase Studio

  1. In the Project Explorer, open the sample you just loaded.

  2. Open the src/main/eventflow folder.

  3. Open the package folder (most samples contain a single package folder. Open the top-level package folder if your sample contains more than one folder).

  4. Open the named application and click the Run button. This opens the SB Test/Debug perspective and starts the application.

    If you see red marks, wait a moment for the project in Studio to load its features.

    If red marks do not resolve themselves in a moment, select the project and right-click MavenUpdate Project from the context menu.

  5. In the Project Explorer view, open the src/main/resources folder and double-click myfile.csv to see the CSV records that will be converted to binary.

  6. From the SB Test/Debug perspective, in the Manual Input view, select the DoWrite input stream and click Send Data.

  7. Look for tuples emitted on the BinaryDataWritten output stream and observe they contain the values from myfile.csv.

  8. Select the DoRead input stream and click Send Data.

  9. Look for tuples emitted on the BinaryDataRead output stream and observe they contain the values from myfile.csv.

  10. Select the DoDelete input stream and click Send Data.

  11. Look for tuples emitted on the DeleteStatus output stream to confirm the file was deleted.

  12. When done, press F9 or click the Stop Running Application button.

Sample Location

When you load the sample into StreamBase Studio, Studio copies the sample project's files to your Studio workspace, which is normally part of your home directory, with full access rights.


Load this sample in StreamBase Studio, and thereafter use the Studio workspace copy of the sample to run and test it, even when running from the command prompt.

Using the workspace copy of the sample avoids permission problems. The default workspace location for this sample is:


See Default Installation Directories for the default location of studio-workspace on your system.