About 8,460,000 results
Open links in new tab
  1. Reading / Fixing a corrupt parquet file - Stack Overflow

    Sep 3, 2024 · Reading / Fixing a corrupt parquet file Asked 1 year, 1 month ago Modified 5 months ago Viewed 2k times

  2. Inspect Parquet from command line - Stack Overflow

    How do I inspect the content of a Parquet file from the command line? The only option I see now is $ hadoop fs -get my-path local-file $ parquet-tools head local-file | less I would like to avoid

  3. Is it possible to read parquet files in chunks? - Stack Overflow

    Nov 29, 2019 · The Parquet format stores the data in chunks, but there isn't a documented way to read in it chunks like read_csv. Is there a way to read parquet files in chunks?

  4. How do I get schema / column names from parquet file?

    Nov 24, 2015 · Also, Cloudera (which supports and contributes heavily to Parquet) has a nice page with examples on usage of hangxie's parquet-tools. An example from that page for your …

  5. indexing - Index in Parquet - Stack Overflow

    Basically Parquet has added two new structures in parquet layout - Column Index and Offset Index. Below is a more detailed technical explanation what it solves and how. Problem …

  6. How to read a Parquet file into Pandas DataFrame?

    How to read a modestly sized Parquet data-set into an in-memory Pandas DataFrame without setting up a cluster computing infrastructure such as Hadoop or Spark? This is only a …

  7. Extension of Apache parquet files, is it '.pqt' or '.parquet'?

    Oct 19, 2021 · 41 I wonder if there is a consensus regarding the extension of parquet files. I have seen a shorter .pqt extension, which has typical 3-letters (like in csv, tsv, txt, etc) and then …

  8. Spark parquet partitioning : Large number of files

    Jun 28, 2017 · I am trying to leverage spark partitioning. I was trying to do something like data.write.partitionBy ("key").parquet ("/location") The issue here each partition creates huge …

  9. Python: save pandas data frame to parquet file - Stack Overflow

    Dec 9, 2016 · Is it possible to save a pandas data frame directly to a parquet file? If not, what would be the suggested process? The aim is to be able to send the parquet file to another …

  10. .net - How to load parquet files into Power BI desktop …

    Jan 3, 2023 · How to load parquet files into Power BI desktop (ParquetSharp) error Asked 2 years, 10 months ago Modified 2 years, 10 months ago Viewed 3k times