Massmine

From UFRC
Revision as of 20:00, 3 December 2014 by Magitz (talk | contribs)
Jump to navigation Jump to search

Description

MassMine website  

MassMine is a social media mining and archiving application that simplifies the process of collecting and managing large amounts of data across multiple sources. It is designed with the researcher in mind, providing a flexible framework for tackling individualized research needs. MassMine is designed to run both on personal computers and dedicated servers/clusters. MassMine handles credential authorizations, data acquisition & archiving, as well as customized data export and analysis.


Additional Information

Installing MassMine at UF Research Computing

You will need:

  • UF Research Computing Account: apply here.
  • A Twitter account
  • An SSH client:
    • Mac: Terminal is installed at /Applications/Utilities/Terminal
    • Windows: Most users use PuTTY
  • An SFTP client:

Connecting to Research Computing

The best host for running MassMine is daemon1.rc.ufl.edu. There is some additional information about this host Daemons here. If you have a Mac or Linux machine, in the Terminal application, type:

ssh username@daemon1.rc.ufl.edu

Use your username, and enter your password when prompted. Note that when you type your password, nothing will appear.

If you have a Windows machine, open PuTTY and enter username@daemon1.rc.ufl.edu in the Hostname box of the window and click Open. Enter your password when prompted, again nothing will appear when you type your password.

Downloading MassMine

MassMine is hosted on GitHub and is under active development. At this time we are suggesting that each user download their own copy of MassMine and update it periodically. We recommend installing MassMine in your /scratch/lfs directory. The following is an example of the steps needed to download MassMine. You should use your username where you see "magitz", that is my username.

[magitz@daemon1 ~]$ cd /scratch/lfs/magitz/
[magitz@daemon1 magitz]$ git clone https://github.com/n3mo/massmine.git
Initialized empty Git repository in /scratch/lfs/magitz/massmine/.git/
remote: Counting objects: 199, done.
remote: Total 199 (delta 0), reused 0 (delta 0)
Receiving objects: 100% (199/199), 147.03 KiB, done.
Resolving deltas: 100% (106/106), done.
[magitz@daemon1 magitz]$ cd massmine/
[magitz@daemon1 massmine]$ module load R
[magitz@daemon1 massmine]$ ./massmine 


Note the "module load R" command. This gets the system ready to run R. The module system is not a standard Linux feature, but greatly facilitates running applications at Research Computing. Learn more about [Modules modules here].