More

autodidacticon · 2025-01-26T06:00:42 1737871242

What is NDV partitioning?

artwr · 2025-01-26T06:47:33 1737874053

NDV = Number of distinct values. Here partitioning on high cardinality columns, essentially.

autodidacticon · on May 7, 2021

It would seem that the world of data engineering is largely dominated by Java.

_hzrk · on May 8, 2021

I share the same view, based on the jobs in my area, everyone is using Python to prototype and Java/Scala to productionize, I've also seen Node.js increasingly being used, probably for things that are I/O-intensive and not compute-bounded. Rust could change the landscape for Node/Deno, critical code which is memory/cpu-intensive could be exported to a WASM library or an external dependency that can be called through FFI (though that would be very slow and you introduce the 2-language problem that Julia is trying to fix for Python). Anyway, even if we are in a way better state today than, let's say, 5 years ago when it comes to picking tools for data engineering, I strongly believe that JVM is here to say for a long, long time.

autodidacticon · on April 30, 2021

but have found that only companies at the largest scale want to pursue the distsys approaches that I'm most interested in (e.g., gossip/CRDTs, consensus algorithms, etc.).

Can this be disproven? Are there companies of smaller scale that are attempting to solve these problems?

ta473589 · on April 30, 2021

I would also say, having worked a company of smaller scale trying to solve distsys problems, that they're hard to sell. But the distsys skill set does seem (to me) marketable at the large cloud providers.

ta473589 · on April 30, 2021

There are definitely companies at smaller scale that work on these problems. I would say that the only time it really feels solved to me is if the scale is very large.

autodidacticon · on Oct 23, 2020

| I'm as big of a Satya fan as anyone, but someone who doesn't get enough credit is Ballmer.

Agreed. Ballmer's tenure is widely misunderstood. Gates does not receive enough blame for the missteps of that era.

Great read: https://www.vanityfair.com/news/business/2012/08/microsoft-l...

autodidacticon · on Oct 14, 2019

Online courses may be helpful, but I recommend applying for a junior development position and seeking immediate employment. Professional experience will impart far more technical knowledge than coursework.

opendomain · on Oct 14, 2019

I am sorry - I can not agree with this. A LOT of people think that they want to be developers, but find out it can be very frustrating.

I suggest taking a few courses first to see if you like development.

folkhack · on Oct 14, 2019

I'm not in computational work as a dev, but I would argue that coursework and the how things work in the professional world are two different ballgames. Jumping into the deep-end isn't always bad advice =)

I also agree with you at the same time. Just playing devil's advocate!

autodidacticon · on Feb 6, 2019

The Origin of Consciousness in the Breakdown of the Bicameral Mind - Julian Jaynes

Ancient human's may have more in common with schizophrenics of today than history would indicate. Religion may be a result thereof.

A People's History of the United States - Howard Zinn

US History can be considered a function of large scale popular movements rather than decisions made by individuals (leaders).

Gun's Germs & Steel - Jared Diamond

Civilizations' success can be considered a function of base resources (edible grains, farmland, metals).

autodidacticon · on July 27, 2017

Standing desk. Skip lunch. Green smoothies.

autodidacticon · on Jan 1, 2017

You must expend enough effort that you feel some “yuck.”

Perform the tasks that you would otherwise google. Good luck, half fun.

autodidacticon · on Nov 21, 2016

1. Install Duolingo on your phone 2. Change your phone's microphone default language to Spanish 3. Practice by dictating spanish directly to your phone without typing

autodidacticon · on Aug 18, 2016