2021년/Data
koalas
오늘은 코알라스를 알아보려한다. Apache Spark위에 Pandas API를 구현한 기능이다. But when they have to work with really large data they don’t have option they have to migrate to PySpark due to scalability issue in Pandas. In-Order to solve this problem Data-bricks introduced a solution called “Koalas” a library where you can transfer your data between Pandas and PySpark very easily without changing nearly ~75% of your na..
2021. 3. 23. 19:37