Posts

Showing posts from July, 2024

Text file line shuffling in Java

Image
This program is an elementary tutorial in Java illustrating basic syntax, subroutines and basic file I/O. Shuffling the lines of a text file is a very important step when preparing .csv training datasets for Machine Learning models. Java was originally created for rich web applications (see 'applets') and embedded programming, but it quickly became a general-purpose language with seminal cross-platform deployability due to the JVM design. In this example, legacy Java syntax is used for illustrating coding with the very first release (JDK 1.0) back in 1995. This means no support for generics, i.e., only 'Object'-based containers (Vector) and extremely limited API compared to recent versions. And of course no syntax highlighting or code completion features, which where not available in terminals with 'vi' and command-line compiler as the core development platform. Of course, the source can be compiled with today's Java 19 compiler just fine, with the -Xlint:u

Week01: Kick off - "And so it begins..."

Image
In every writing endeavor, the most difficult part is the beginning. This first week is allocated mostly to setting up things, drafting the Preface and Prologue. A lot of notes and references to organize. #DSbook https://apneacoding.blogspot.com/2024/07/announcement-writing-new-book-in-data.html    

Announcement: Writing a new book in Data Science and R&D projects

Image
About two years ago an idea was hovering through my mind, a question actually: Why is that “traditional” project managers fail so gloriously in managing teams, especially in the Research and Development (R&D) field of work? I let it mature for a while, thinking that someone will eventually write a book or something about it. Two years have passed and no one did it, so I figured, well, I guess I will have to do that myself. After sending the proposal to only few top publishers, I was privileged enough to land a book contract with Elsevier , which was indeed my first choice for this endeavor.  The subject of the book will be simple: Managing and coordinating teams in the R&D environment, specifically in the domains of Data Analytics, Machine Learning and Artificial Intelligence in general , where I have been working for more than two and a half decades as a professional. Specifically, it will focus on every aspect of teamwork from the first to the last day of a R&D project, i

Channel Updates