Contents
This sample demonstrates the use of the Spotfire Streaming Binary File Reader and Writer Adapters for Apache HDFS. HDFS is the well-known acronym for Apache's Hadoop Distributed File System.
In StreamBase Studio, import this sample with the following steps:
-
From the top-level menu, click
> . -
Enter
binary
to narrow the list of options. -
Select HDFS binary file input/output adapters from the Large Data Storage and Analysis category.
-
Click
.
StreamBase Studio creates a project for this sample.
-
In the Project Explorer view, open the sample you just loaded.
If you see red marks on a project folder, wait a moment for the project to load its features.
If the red marks do not resolve themselves after a minute, select the project, right-click, and select
> from the context menu. -
Open the
src/main/eventflow/
folder.packageName
-
Open the
binaryrw.sbapp
file and click the Run button. This opens the SB Test/Debug perspective and starts the module. -
In the Project Explorer view, open the
src/main/resources
folder and double-clickmyfile.csv
to see the CSV records that will be converted to binary. -
From the SB Test/Debug perspective, in the Manual Input view, select the DoWrite input stream and click
. -
Look for tuples emitted on the BinaryDataWritten output stream and observe they contain the values from
myfile.csv
. -
Select the DoRead input stream and click
. -
Look for tuples emitted on the BinaryDataRead output stream and observe they contain the values from
myfile.csv
. -
Select the DoDelete input stream and click
. -
Look for tuples emitted on the DeleteStatus output stream to confirm the file was deleted.
-
When done, press F9 or click the Terminate EventFlow Fragment button.
When you load the sample into StreamBase® Studio, Studio copies the sample project's files to your Studio workspace, which is normally part of your home directory, with full access rights.
Important
Load this sample in StreamBase® Studio, and thereafter use the Studio workspace copy of the sample to run and test it, even when running from the command prompt.
Using the workspace copy of the sample avoids permission problems. The default workspace location for this sample is:
studio-workspace
/sample_hdfsbinaryrw
See Default Installation
Directories for the default location of studio-workspace
on your system.