ETL With Apache Airflow
Outline of the Project
with open(filename.json, "r") as read_file:
data = json.load(read_file)
data
is of type dictionary. It’s wonderful.
Outline of the Project
I’ve recently added somemissing .SCSS files tomy websites github repository, which will now let write my own theme. I’ve taken inspiration from flower arrang...
Since graduating high school six years ago, I’ve always felt like my long-term memory and ability to meaningfully learn new concepts has been declining. Anec...
What I did: Monday: Kaggle work with giant spotify dataset Tuesday: Added use of the Spotify API, spotipy to the notebook. Weds - Sat: Added recommend...
One-shot transfer learning with regression
I created a really nice template for ML projects that I hope BNL will continue to use for a long time.
What I did Probability Problems
What I did Monday: Fixed MSD Values of particle averaged structure. I was double counting some bonds before. Tuesday: Neural Network now predicts 5 feat...
What I did Sunday: Trained the neural network on the skew-norm dataset and spent some time playing around with the architecture and hyperparameters. Mon...
Examples of using pandas .pipe()
Hurst Oxponent, Time Series, etc.
What I did Monday: Energy mesh interpolation codes Tuesday - Thursday: Felt dead from the seconds shot of the Moderna covid vaccine. Didn’t do much. I’v...
What I did Monday: Finished script to generate thousands of real-disordered structures. Tuesday: Wrote a script to loop through the results from the sim...
What I did Monday: NN data-loading Tuesday: NN pre-processing Wednesday: NN pause. I need to test that my gaussian averaging method is valid enough to...
What I did Sunday: Thesis chapter \(\LaTeX\) writing Monday: Thesis chapter \(\LaTeX\) writing Tuesday: More writing, but I also redid some figures co...
Skewnorms and comparing simulations to experiment data
Website building, mathematica rendering, and lots of seaborn manipulation
What I did Monday: Got FEFF to work. Read paper: “Probing Atomic Distributions in Mono- and Bimetallic Nanoparticles by Supervised Machine Learning” Nano ...
What I did Monday: Train ride back to Boston. Wrote python script to generate .inp files with different radially dispalced atoms Tuesday: Cleaned up the...
What I did Monday: Worked on jupyter notebook. See plotly plot from last week Tuesday: Feff9 on linux debugging all day conference call. Wednesday: Pr...
What I did Sunday: XAFS Thesis writing Monday: Traveling Tuesday: FEFF Codes and disorder in XANES paper reading Wednesday: 2010 Phys Review B, “Eff...
This blog will now partly shift from what I learned as a data scientist to what I’ve learned as a scientist, to keep track of my thesis progress. Of course s...
<!– — title: “On the Importance of Data Literacy in Decision Making” date: ‘2021-02-10’ categories: miscellaneous tags: statistics
I’m getting close to my final exams so most of my time will go towards studying. Accordingly, expect a hiatus from blog postings.
Bayesian analysis of Amazon reviews in R and the most important functions in Pandas
GRU’s, strings, and batch normalization. I didn’t post anything last week because I was busy with grad school work and traveling – I flew back home to New Yo...
Homo and heteroskedasticity. When to use NumPy vs. Pandas.
R equivalencies in Python.
Decision trees are one of the first data structures you learn in computer science (graphically, in practice you might learn them as a linked list), but you m...
Probability and statistics riddles
Stats riddles, Covariance Matrix, etc.
SVD and Eigendecomposition, XGBoost, etc.
SQL, Naive Bayes, Python tips
SQL, precision, recall, and fun with Numpy
Original Paper
Note, last week is missing. Most of what I did is included in the prior week, and the rest is included here. It wasn’t enough to merit its own week, because ...
Sensitivity, CSV reading, and Pandas
KRR, SVD, and PCA
I’ve decided to combine my weekly’s for these two weeks because I’ve had to spend a lot of time traveling (Germany->Strasbourg->Paris->Montpellier) ...
What I did
What I did
Sigmoid
What I did
What I did
What I did
What I did
What I did
What I did
What I did
What I did
Pytorch uses tensors instead, similar to a numpy array
What I did
RMSprop is similar to gradient descent with momentum, but takes into consideration the sign of the gradient like this:
In order to implement a neural network, a solid understanding of Linear Algebra is needed for the forward pass, whereas vector calculus is needed for backpro...
What I did
What I did
In my previous miscellaneous post, I wrote about why I gave up on WSL1 and switched to running ubuntu in a virtual machine. After looking into WSL2 more, I c...
Update: Using a virtual machine has not been as smooth as I expected. I’ve ended up with some weird problems after a few days. For example: sometimes my trac...
What I did
This is my first post, and I would like to explain my plan for this blog. During my summer work with my undergraduate thesis advisor, I would have a weekly m...