Scott Stoltzman's Avatar

Scott Stoltzman

@stoltzmaniac

You’re the data guy right? Yeah! Can you fix my printer?

43
Followers
179
Following
44
Posts
31.01.2024
Joined
Posts Following

Latest posts by Scott Stoltzman @stoltzmaniac

Definitely avoid using renv inside your project, computer immediately overheats

09.01.2025 02:18 👍 1 🔁 0 💬 0 📌 0
Post image

And look at those great front-end chops. Beautiful Responsive design #css

08.01.2025 01:51 👍 1 🔁 0 💬 0 📌 0

Before I did any filtering, the biggest thing I noticed was that the most popular (not inappropriate tags) were always #art #photo / #photography

Thought that was pretty cool to see.

07.01.2025 23:12 👍 0 🔁 0 💬 0 📌 0
Video thumbnail

I have to say, the #bluesky API is really fun to play with. Websockets are definitely less intimidating than you think. 3 files to make this page below #javascript #html #css

Track whatever specific words you want, it tallies up the hashtags in the posts.

tinyurl.com/hashtagexample

07.01.2025 23:05 👍 4 🔁 0 💬 2 📌 0

Would definitely use something like this

04.01.2025 23:32 👍 2 🔁 0 💬 0 📌 0

@econmaett.bsky.social - Any desire for read access to the hashtags located on Azure to do some #dataviz or #rtstats? DM for temporary read-only access. #duckdb integrates seamlessly. Data in parquet:
did (user ID), cid (post ID), hashtag, created_at, text

ALL hashtag posts being stored.

02.01.2025 14:51 👍 0 🔁 0 💬 0 📌 0

python-dotenv

seriously... someone needs to give up dotenv

02.01.2025 02:26 👍 1 🔁 0 💬 0 📌 0

Added some extra spam filtering and filters to help keep the data more accurate.

02.01.2025 02:19 👍 0 🔁 0 💬 1 📌 0

Looks like the bot isn’t posting to the same thread, nor is it resetting. Haven’t looked since Christmas. Looks like these hashtags are stabilizing in the top 3, same with combos. Will need to modify the script and swap it to 12 hours instead of 6.

28.12.2024 22:27 👍 0 🔁 0 💬 0 📌 0
Post image
23.12.2024 21:34 👍 0 🔁 0 💬 0 📌 0

Each tag (yellow) should grow relationships (USED_TAG) to users (green). #sql should link this account.

23.12.2024 21:30 👍 0 🔁 0 💬 1 📌 0
Post image

I'll be working on the visualization over the holidays to embed network diagrams for the neo4j implementation of @stoltzmaniacbot.bsky.social programming hashtag bot

23.12.2024 20:26 👍 0 🔁 0 💬 1 📌 0

Dang it, thought that it wouldn’t “tag” unless it turned it blue, will cut this down to once per day until I figure out how to make it stop. Also, #rstats made an overnight comeback! Scott Bot will be modified tonight, feel free to suggest anything

23.12.2024 13:34 👍 0 🔁 0 💬 0 📌 0

Happy the hashtag bot doesn’t actually hashtag. Table needs prettying up. I was convinced the R community dominated the programming hashtag world. But maybe it’s just the weekend/holiday slump

23.12.2024 00:43 👍 0 🔁 0 💬 0 📌 0

Whoa, sorry for the bot testing spam! Gonna ruin another hashtag for testing next time

21.12.2024 22:03 👍 1 🔁 0 💬 0 📌 0

#javascript only. 4 life

21.12.2024 21:04 👍 0 🔁 0 💬 1 📌 0
Post image

Looks like #python has the best rating for mutual posting with mentions tagged. #golang coming in last
Total nodes: 129k

#python = 19%
#rstats = 10%
#golang = 4%

Sampling method is questionable:)

19.12.2024 19:03 👍 1 🔁 0 💬 0 📌 0
Post image

Also, run fact @mackaszechno.bsky.social , you were in my recommendations list!

19.12.2024 18:12 👍 0 🔁 0 💬 1 📌 0

Yeah, took me a while to pick. It seemed as if the OSS + examples + documentation lowered the barrier to entry. First time using actual graph database rather than the in memory packages reading from CSVs

19.12.2024 18:10 👍 1 🔁 0 💬 1 📌 0

I’d love to share easy access to this #neo4j database for querying. Any ideas of the simplest way to do so without simply handing out user/pass?

19.12.2024 16:23 👍 0 🔁 0 💬 0 📌 0
Post image Post image Post image Post image

Scraped #python and #rstats from the latest package I built to take bsky data directly into #neo4j to make recommendations for follows directly out of a cypher query. Impressed by the speed! Going to have to start following some new people! I live in a small bubble in the graph visualization.

19.12.2024 16:22 👍 4 🔁 1 💬 3 📌 0
Post image Post image

Made a couple of updates to #bskydata to simplify imports. Also, considering adding a "write to network" function. What is the consensus on the best network visualization library? Here is pyvis - ChatGPT defaults, didn't do any customization.

pip install bskydata

#python #pyvis

16.12.2024 23:41 👍 0 🔁 0 💬 0 📌 0

Namespace related stuff always makes for a good R-ant.

R 4.0+ has made some improvements. hoping that modules get introduced somehow (seems impossible) rather than having to continue making new names for every single task

16.12.2024 13:33 👍 1 🔁 0 💬 0 📌 0

Don’t people realize that there’s a reason “Jupyter” rather than “Stataturn” notebooks became major tools?

#rstats #stata #python #julia

15.12.2024 15:10 👍 1 🔁 0 💬 0 📌 0

Once you see it, you can’t unsee it. Very cool!

14.12.2024 01:09 👍 0 🔁 0 💬 0 📌 0
Post image Post image Post image

BREAKING: #python bskydata now has parser functionality so that data can be consistent, clean, and customized. No more full JSON dumps unless you need it. This will save a lot of storage space.

pip install bskydata

followers.json, follows.json, search_terms.json shown.

13.12.2024 16:16 👍 0 🔁 0 💬 0 📌 0
Post image

BREAKING: #python bskydata now scrapes and writes data directly to #mongodb -- I need some help on how it should insert, but it does connect and insert records! Simply switch out one line, your DataWriter is now MongoDBDataWriter

pip install bskydata[mongodb]

12.12.2024 22:55 👍 0 🔁 0 💬 1 📌 0

Also, @brianokken.bsky.social - guessing you may have some knowledge about what the best testing resources are. I want to write tests but since it's mostly just moving data around from APIs, it feels incredibly difficult to understand if things are working properly. Any mocking strategy you like?

12.12.2024 20:41 👍 0 🔁 0 💬 1 📌 0
Post image Post image Post image Post image

BREAKING: #python BlueSky data scraping with auto-pagination now writes directly to the cloud! Integrations with #azure #aws #gcp have been completed. Simply swap out AzureDataWriter, GCPDataWriter, S3DataWriter in your scraper.

pip install [azure, aws, gcp]

Examples:
github.com/stoltzmaniac...

12.12.2024 19:39 👍 1 🔁 0 💬 1 📌 0
Post image

BREAKING: BlueSky data scraping is now easier. New update to #python bskydata package allowing you to search and store #bluesky posts. Auto-pagination & automated local file storage. Built for simplicity.

pip install bskydata

github.com/stoltzmaniac...

12.12.2024 04:17 👍 0 🔁 0 💬 1 📌 0