display spark datasets and dataframes with table widget #7040

scottdraves · 2018-03-25T02:09:33Z

indeed. the widget should show just what spark normally shows ascii-formatted tables (sometimes very wide).

is there a way to configure show method to use our repr?
maybe via https://github.com/jupyter/jvm-repr?
add a new method or a subclass?

after the tables support streaming #7006
then we can connect them directly to that.

scottdraves · 2018-03-25T02:12:49Z

related: #7041

jpallas · 2018-03-25T07:06:18Z

show is defined directly by Dataset and has no return value, so I don't think there's any way to get it into a table short of scraping the output (or doing unspeakable things with cglib).

scottdraves · 2018-03-26T03:21:45Z

yea should be a display handler, installed by jvm-repr.
maybe start with 1000 rows.

scottdraves · 2018-03-28T02:24:38Z

@jpallas on the other issue you said:

(I've prototyped this).

care to share? jarek could pick up your work if you think it's on the right track.

jpallas · 2018-03-28T18:54:15Z

What I did is just a few lines:

implicit class DatasetOps(ds: org.apache.spark.sql.Dataset[_]) {
    def display(rows: Int = 20) = {
        // I do not understand why this import is necessary
        import com.twosigma.beakerx.scala.table.TableDisplay
        val columns = ds.columns
        val rowVals = ds.toDF.take(rows)
        val t = new TableDisplay(rowVals map (row => (columns zip row.toSeq).toMap))
        t.display()
    }
}

(There is something strange going on with imports and visibility in the Scala kernel, maybe some odd interaction with the way the interpreter wraps things.)

Example with a Dataset[Fields] where case class Fields(first: String, second: String, third: Int):

scottdraves assigned jaroslawmalekcodete Mar 25, 2018

scottdraves added Enhancement Runtime Java labels Mar 25, 2018

scottdraves mentioned this issue Mar 25, 2018

jarek/6513: spark magic command #6993

Merged

scottdraves changed the title ~~display spark datasets with table widget~~ display spark datasets and dataframes with table widget Mar 25, 2018

jpallas mentioned this issue Mar 29, 2018

Investigate fix/workaround for Scala import issue #7100

Closed

jaroslawmalekcodete added a commit that referenced this issue Apr 9, 2018

#7040: display spark datasets with table widget

a2d3fdf

scottdraves closed this as completed May 14, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

display spark datasets and dataframes with table widget #7040

display spark datasets and dataframes with table widget #7040

scottdraves commented Mar 25, 2018 •

edited

Loading

scottdraves commented Mar 25, 2018

jpallas commented Mar 25, 2018

scottdraves commented Mar 26, 2018 •

edited

Loading

scottdraves commented Mar 28, 2018 •

edited

Loading

jpallas commented Mar 28, 2018

display spark datasets and dataframes with table widget #7040

display spark datasets and dataframes with table widget #7040

Comments

scottdraves commented Mar 25, 2018 • edited Loading

scottdraves commented Mar 25, 2018

jpallas commented Mar 25, 2018

scottdraves commented Mar 26, 2018 • edited Loading

scottdraves commented Mar 28, 2018 • edited Loading

jpallas commented Mar 28, 2018

scottdraves commented Mar 25, 2018 •

edited

Loading

scottdraves commented Mar 26, 2018 •

edited

Loading

scottdraves commented Mar 28, 2018 •

edited

Loading